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GENE THERAPY FOR CYSTIC FIBROSIS 

Related Applications 

This application is a continuation-in-part application of United States Serial Number 
5 08/1 30,682, filed on October 1, 1993 which is a continuation-in-part application of United 
States Serial Number 07/985,478, filed on December 2, 1992, which is a continuation-in-part 
application of United States Serial Number 07/613,592, filed on November 15, 1990, which 
is in turn a continuation-in-part application of United States Serial Number 07/589,295, filed 
on September 27, 1990, which is itself a continuation-in-part application of United States 

10 Serial Number 07/488,307, filed on March 5, 1990. The contents of all of the above co- 
pending patent applications are incorporated herein by reference. Definitions of language or 
terms not provided in the present application are the same as those set forth in the copending 
applications. Any reagents or materials used in the examples of the present application 
whose source is not expressly identified also is the same as those described in the copending 

15 application, e.g., AF508 CFTR gene and CFTR antibodies. 

Background of flue Invention 

Cystic Fibrosis (CF) is the most common fatal genetic disease in humans (Boat, T.F. 
et al. in The Metabolic Basis of Inherited Diseases (Scriver, C.EL et al. eds., McGraw-Hill, 

20 New York (1 989)). Approximately one in every 2,500 infants in the United States is born 
with the disease. At the present time, there are approximately 30,000 CF patients in the 
United States. Despite current standard therapy, the median age of survival is only 26 years. 
Disease of the pulmonary airways is the major cause of morbidity and is responsible for 95% 
of the mortality. The first manifestation of lung disease is often a cough, followed by 

25 progressive dyspnea. Tenacious sputum becomes purulent because of colonization of 

Staphylococcus and then with Pseudomonas. Chronic bronchitis and bronchiectasis can be 
partially treated with current therapy, but the course is punctuated by increasingly frequent 
exacerbations of the pulmonary disease. As the disease progresses, the patient's activity is 
progressively limited End-stage lung disease is heralded by increasing hypoxemia, 

30 pulmonary hypertension, and cor pulmonale. 

The upper airways of the nose and sinuses are also involved in CF. Most patients 
with CF develop chronic sinusitis. Nasal polyps occur in 15-20% of patients and are 
common by the second decade of life. Gastrointestinal problems are also frequent in CF; 
infants may suffer meconium ileus. Exocrine pancreatic insufficiency, which produces 

35 symp|oms of malabsorption, is present in the large majority of patients with CF. Males arc 
almost uniformly infertile and fertility is decreased in females. 

Based on both genetic and molecular analyses, a gene associated with CF was isolated 
as part of 21 individual cDNA clones and its protein product predicted (Kerem, B.S. et al. 
* (1989) Science 245:1073-1080; Riordan, J.R. et al. (1989) Science 245:1066-1073; 
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Rommens, J.M. ct al. (1989) Science 245:1059-1065)). United States Serial Number 
07/488,307 describes the construction of the gene into a continuous strand, expression of the 
gene as a functional protein and confirmation that mutations of the gene are responsible for 
CF. (See also Gregory, RJ. et al. (1990) Nature 347:382-386; Rich, D.P. et al. (1990) Nature 
5 347:358-362). The co-pending patent application also discloses experiments which show that 
proteins expressed from wild type but not a mutant version of the cDNA complemented the 
defect in the cAMP regulated chloride channel shown previously to be characteristic of CF. 

The protein product of the CF associated gene is called the cystic fibrosis 
transmembrane conductance regulator (CFTR) (Riordan, J.R. et al. (1989) Science 245:1066- 
10 1073). CFTR is a protein of approximately 1480 amino acids made up of two repeated 
elements, each comprising six transmembrane segments and a nucleotide binding domain. 
The two repeats are separated by a large, polar, so-called R-domain containing multiple 
potential phosphorylation sites. Based on its predicted domain structure, CFTR is a member 
of a class of related proteins which includes the multi-drug resistance (MDR) or*P- 
15 glycoprotein, bovine adenyl cyclase, the yeast STE6 protein as well as several bacterial 

amino acid transport proteins (Riordan, J.R. et al. (1989) Science 245:1066-1073; Hyde, S.C. 
et al. (1990) Nature 346:362-365). Proteins in this group, characteristically, are involved in 
pumping molecules into or out of cells. 

CFTR has been postulated to regulate the outward flow of anions from epithelial cells 
20 in response to phosphorylation by cyclic AMP-dependent protein kinase or protein kinase C 
(Riordan, J.R. et al. (1989) Science 245:1066-1073; Welsh, 1986; Frizzell, RA. et al. (1986) 
Science 233:558-560; Welsh, MJ. and Liedtke, CM. (1986) Nature 322:467; Li, M. et al. 
(\9SS)Nature 331:358-360; Huang, T-C. et al. (1989) Science 244:1351-1353). 

Sequence analysis of the CFTR gene of CF chromosomes has revealed a variety'qf 
25 mutations (Cutting, G.R. et al. (1990) Nature 346:366-369; Dean, M. et al. (1990) Cell 

61:863-870; and Kerem, B-S. et al. (1989) Science 245:1073-1080; Kerem, B-S. et al. (199P) 
Proc. Natl Acad Set USA 87:8447-8451). Population studies have indicated that the most 
common CF mutation, a deletion of the 3 nucleotides that encode phenylalanine at position 
508 of the CFTR amino acid sequence (AF508), is associated with approximately 70% of the 
30 cases of cystic fibrosis. This mutation results in the failure of an epithelial cell chloride 
channel to respond to cAMP (Frizzell R.A. et al. (1986) Science 233:558-560; Welsh, M.J. 
. • (1986) Science 232:1648-1650.; Li, M. et al. (X9U)Nature 331:358-360; Quinton, P.M. 
(1989) Clin. Chem. 35:726-730). In airway cells, this leads to an imbalance in ion and fluid 
transport. It is widely believed that this causes abnormal mucus secretion, and ultimately 
35 results in pulmonary infection and epithelial cell damage. 

Studies on the biosynthesis (Cheng, S.H. et al. (1990) Cell 63:827-834; Gregory, RJ. 
etal. (1991)Afo/. Cell Biol. 11:3886-3893) and localization (Denning, G.M. et al. (1992)./ 
Cell Biol. 1 1 8:55 1-559 ) of CFTR AF508, as well as other CFTR mutants, indicate that many 
CFTR mutant proteins are not processed correctly and, as a result, are not delivered to the 
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plasma membrane (Gregory, R.J. etal. ( 1991) Mol Cell Biol 11:3886-3893). These 
conclusions are consistent with earlier functional studies which failed to detect cAMP- 
stimulated CI" channels in cells expressing CFTR AF508 (Rich, D.P. et al. (1990) Nature 
347:358-363; Anderson, M.P. et al. (1991) Science 251:679-682). 
5 To date, the primary objectives of treatment for CF have been to control infection, 

promote mucus clearance, and improve nutrition (Boat, T.F. et al. in The Metabolic Basis of 
Inherited Diseases (Scriver, C.R. et al. eds., McGraw-Hill, New York (1989)). Intensive 
antibiotic use and a program of postural drainage with chest percussion are the mainstays of 
therapy. However, as the disease progresses, frequent hospitalizations are required. 
10 Nutritional regimens include pancreatic enzymes and fat-soluble vitamins. Bnonchodilators 
are used at times. Corticosteroids have been used to reduce inflammation, but they may 
produce significant adverse effects and their benefits are not certain. In extreme cases, lung 
transplantation is sometimes attempted (Marshall, S. et al. (1990) Chest 98:1488). 
Most efforts to develop new therapies for CF have focused on the pulmonary 
1 5 complications. Because CF mucus consists of a high concentration of DNA, derived from 
lysed neutrophils, one approach has been to develop recombinant human DNase (Shak, S. et 
al. (1990) Proc. Natl ScL Acad USA 87:9188). Preliminary reports suggest that aerosolized 
enzyme may be effective in reducing the viscosity of mucus. This could be helpful in 
clearing the airways of obstruction and perhaps in reducing infections. In an attempt to limit 
20 damage caused by an excess of neutrophil derived elastase, protease inhibitors have been 

tested. For example, alpha- 1 -antitrypsin purified from human plasma has been aerosolized to 
deliver enzyme activity to lungs of CF patients (McElvaney, N. etal. (1991) The Lancet 
337:392). Another approach would be the use of agents to inhibit the action of oxidants 
derived from neutrophils. Although biochemical parameters have been successfully 
25 measured, the long term beneficial effects of these treatments have not been established. 

Using a different rationale, other investigators have attempted to use pharmacological 
agents to reverse the abnormally decreased chloride secretion and increased sodium 
absorption in CF airways. Defective electrolyte transport by airway epithelia is thought to 
alter the composition of the respiratory secretions and mucus (Boat, T.F. et al. in The 
30 Metabolic Basis of Inherited Diseases (Scriver, C.R, et al. eds., McGraw-Hill, New York 
(1989); Quinton, P.M. (1990) FASEBJ. 4:2709-2717). Hence, pharmacological treatments 
aimed at correcting the abnormalities in electrolyte transport could be beneficial. Trials are in 
progress with aerosolized versions of the drug amiloride; amiloride is a diuretic that inhibits 
sodium channels, thereby inhibiting sodium absorption. Initial results indicate that the drug 
35 is safe and suggest a slight change in the rate of disease progression, as measured by lung 
function tests (Knowles, M. etal. (1990)//. Eng. J. Med 322: 1189-1194; App, E.(1990)^/w. 
Rev. Respir. Dis. 141 :605). Nucleotides, such as ATP or UTP, stimulate purinergic receptors 
in the airway epithelium. As a result, they open a class of chloride channel that is different 
from CFTWcfiloride channels. In vitro studies indicate that ATP and UTP can stimulate 
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chloride secretion (Knowles, M. et al. (1991) N. Eng. J. Med 325:533). Preliminary trials to 
test the ability of nucleotides to stimulate secretion in vivo, and thereby correct the electrolyte 
transport abnormalities are underway. 

Despite progress in therapy, cystic fibrosis remains a lethal disease, and no current 
5 therapy treats the basic defect. However, two general approaches may prove feasible. These 
are: 1 ) protein replacement therapy to deliver the wild type protein to patients to augment 
their defective protein, and; 2) gene replacement therapy to deliver wild type copies of the CF 
associated gene. Since fee most life threatening manifestations of CF involve pulmonary 
complications, epithelial cells of the upper airways are appropriate target cells for therapy. 

10 The feasibility of gene therapy has been established by introducing a wild type cDNA 

intb epithelial cells from a CF patient and demonstrating complementation of the hallmark 
defect in chloride ion transport (Rich, D.P. et al. (1990) Nature 347:358-363 ). This initial 
work involved cells in tissue culture, however, subsequent work has shown that to deliver the 
gene to the airways of whole animals, defective adenoviruses may be useful (Rosenfeld, 

15 (1992) Cell 68:143-155). However, the safety and effectiveness of using defective 
adenoviruses remain to be demonstrated. 



Summary nf the Invention 

In general, the instant invention relates to vectors for transferring selected genetic 

20 material of interest (e.g., DNA or RNA) to cells in vivo. In preferred embodiments, the 

vectors are adenoviras-based. Advantages of adenovirus-based vectors for gene therapy are 
that they appear to be relatively safe and can be manipulated to encode the desired gene 
product and at the same time are inactivated in terms of their ability to replicate in a normal 
lytic viral life cycle. Additionally, adenovirus has a natural tnopism for airway epithelia. 

25 Therefore, adenovirus-based vectors are particularly preferred for respiratory gene therapy 
applications such as gene therapy for cystic fibrosis. 

In one embodiment, the adenovirus-based gene therapy vector comprises an 
adenovirus 2 serotype genome in which the Ela and Elb regions of the genome, which are 
involved in early stages of viral replication have been deleted and replaced by genetic 

30 material of interest (e.g., DNA encoding the cystic fibrosis transmembrane regulator protein). 

In another embodiment, the adenovirus-based therapy vector is a pseudo-adenovirus 
(PAV). PAVs dratain no potentially harmful viral genes, have a theoretical capacity for 
foreign material of nearly 36 kb, may be produced in reasonatjjy high titers and maintain the 
tropism of the parent adenovirus for dividing and non-dividing human target cell types. 

3 5 PAVs comprise adenovirus inverted terminal repeats and the minimal sequences of a wild- 
type adenovirus type 2 genome necessary for efficient replication and packaging by a helper 
virus and genetic material of interest. In a preferred embodiment, the PAV contains 
adenovirus 2 sequences. 
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In a further embodiment, the adenovirus-based gene therapy vector contains the open 
reading frame 6 (ORF6) of adenoviral early region 4 (E4) from the E4 promoter and is 
deleted for all other E4 open reading frames. Optionally, this vector can include deletions in 
the El and/or E3 regions. Alternatively, the adenovirus-based gene therapy vector contains 
5 the open reading frame 3 (ORF3) of adenoviral E4 from the E4 promoter and is deleted for all 
other E4 open reading frames. Again, optionally, this vector can include deletions in the El 
and/or E3 regions. The deletion of non-essential open reading frames of E4 increases the 
cloning capacity by approximately 2 kb without significantly reducing the viability of the 
virus in cell culture. In combination with deletions in the El and/or E3 regions of adenovirus 
1 0 vectors, the theoretical insert capacity of the resultant vectors is increased to 8-9 kb. 

The invention also relates to methods of gene therapy using the disclosed vectors and 
genetically engineered cells produced by the method. 

Brief Description of the Tables and Drawings 

1 5 Further understanding of the invention may be had by reference to the tables and 

figures wherein; 

Table I shows CFTR mutants wherein the known association with CF (Y, yes or N, 
no), exon localization, domain location and presence (+) or absence (-) of bands A, B, and C 
20 of mutant CFTR species is shown. TM6, indicates transmembrane domain 6; NBD 

nucleotide binding domain; ECD, extracellular domain and Term, termination at 21 codons 
past residue 1337; 

Table II shows the nucleotide sequence of Ad2/CFTR-1 ; 

25 

Table III depicts a nucleotide analysis of Ad2-ORF6/PGK-CFTR; 

The convention for naming mutants is first the amino acid normally found at the 
particular residue, the residue number (Riordan, T.R. et al. (1 989) Science 245: 1066-1073). 
30 and the amino acid to which the residue was converted. The single letter amino acid code is 
used: D, aspartic acid;F 9 phenylalanine; G, glycine; I, isoleucine; K, lysine; M, methionine; 
N, asparagine; Q, glutamine; R, arginine; S, serine; W, tryptophan. Thus G551D is a mutant 
in which glycine 551 is convert»dt6 aspartic acid; 

35 Figure 1 shows alignment of CFTR partial cDNA clones used in construction of 

cDNA containing complete coding sequence of the CFTR, only restriction sites relevant to 
the DNA constructions described below are shown; 



Figure 2 depicts plasmid construction of the CFTR cDNA clone pKK-CFTRl ; 
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Figure 3 depicts plasmid construction of the CFTR cDNA clone pKK-CFTR2; 
Figure 4 depicts plasmid construction of the CFTR cDNA clone pSC-CFTR2; 

5 

Figure 5 shows a plasmid map of the CFTR cDNA clone pSC-CFTR2; 

Figure 6 shows the DNA sequence of synthetic DNAs used for insertion of an intron 
into the CFTR cDNA sequence, with the relevant restriction endonuclease sites and 
1 0 nucleotide positions noted; 

Figures 7 A and 7B depict plasmid construction of the CFTR cDNA clone pKK- 
CFTR3; 

1 5 Figure 8 shows a plasmid map of the CFTR cDNA pKK-CFTR3 containing an intron 

between nucleotides 1716 and 1717; 

Figure 9 shows treatment of CFTR with glycosidases; 

20 Figures 1 OA and 1 OB show an analysis of CFTR expressed from COS-7 transfected 

cells; 

Figures 1 1 A and 1 IB show pulse-chase labeling of wild type and AFS08 mutant 
CFTR in COS-7 transfected cells; 

Figures 12A-12D show immunolocalization of wild type and AF508 mutant CFTR; 
and COS-7 cells transfected with pMT-CFTR or pMT-CFTR-AF508; 

Figure 13 shows an analysis of mutant forms of CFTR; 

30 

Figure 14 shows a map of the first generation adenovirus based vector encoding 
CFTR (Ad2/CFTR- 1 ); * 

Figure 15 shows the plasmid construction of the Ad2/CFTR-1 vector; 

35 

Figure 16 shows an example of UV fluorescence from an agarose gel electrophoresis 
of products of nested RT-PCR from lung homogenates of cotton rats which received 
Ad2/CFTR-1. The gel demonstrates that the homogenates were positive for virally-encoded 
CFTR mRNA; 



WO 94/12649 




PCT/US93/11667 



Figure 17 shows an example of UV fluorescence from an agarose gel electrophoresis 
of products of nested RT-PCR from organ homogenates of cotton rats. The gel demonstrates 
that all organs of the infected rats were negative for Ad2/CFTR with the exception of the 
5 small bowel; 



Figures 18A and 18B show differential cell analyses of bronchoalveolar lavage 
specimens from control and infected rats. These data demonstrate that none of the rats 
treated with Ad2/CFTR-1 had a change in the total or differential white blood cell count 4, 
10 1 0, and 1 4 days after infection (Figure 1 8 A) and 3, 7, and 1 4 days after infection (Figure 
18B); 

Figure 19 shows hematoxilyn and eosin stained sections of cotton rat tracheas from 
both treated and control rats sacrificed at different time points after infection with 
1 5 Ad2/CFTR- 1 . The sections demonstrate that there were no observable differences between 
the treated and control rats; 



Figures 20A and 20B show examples of UV fluorescence from an agarose gel 
electrophoresis, stained with ethidium bromide, of products of RT-PCR from nasal brushings 
20 of Rhesus monkeys after application of Ad2/CFTR-1 or Ad2/p-Gal; 



Figure 21 shows lights microscopy and immunocytochemistry from monkey nasal 
brushings. The microscopy revealed that there was a positive reaction when nasal epithelial 
cells from monkeys exposed to Ad2/CFTR-1 were stained with antibodies to CFTR; 

25 

Figure 22 shows immunocytochemistry of monkey nasal turbinate biopsies. This 
microscopy reveals increased immunofluorescence at the apical membrane of the surface 
epithelium from biopsies obtained from monkeys treated with Ad2/CFTR-1 over that seen at 
the apical membrane of the surface epithelium from biopsies obtained from control monkeys; 

30 

Figures 23A-23D show serum antibody titers in Rhesus monkeys after three vector 
administrations. These graphs demonstrate that all three monkeys treated with Ad2/€FTR-1 
developed antibodies against adenovirus; 



35 Figure 24 shows hematoxilyn and eosin stained sections from monkey medial 

turbinate biopsies. These sections demonstrate that turbinate biopsy specimens from control 
monkeys could not be differentiated from those from monkeys treated with Ad2/CFTR-1 
when reviewed by an independent pathologist; 
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Figures 25A-25I show photomicrographs of human nasal mucosa immediately before, 
during, and after Ad2/CFTR-1 application. These photomicrographs demonstrate that 
inspection of the nasal mucosa showed mild to moderate erythema, edema, and exudate in 
patients treated with Ad2/CFTR-i (Figures 25A-25C) and in control patients (Figures 25G- 
5 251). These changes were probably due to local anesthesia and vasocontriction because when 
an additional patient was exposed to Ad2/CFTR in a method which did not require the use of 
local anesthesia or vasoconstriction, there were no symptoms and the nasal mucosa appeared 
normal (Figures 25D-25F); 



1 0 Figure 26 shows a photomicrograph of a hematoxilyn and eosin stained biopsy of 

human nasal mucosa obtained from the third patient three days after Ad2/CFTR-1 
administration. This section shows a morphology consistent with CF, i.e., a thickened 
basement membrane* and occasional morphonuclear cells in the submucosa, but no 
abnormalities that could be attributed to the adenovirus vector, 

15 

Figure 27 shows transepithelial voltage (W{) across the nasal epithelium of a normal 
human subject. Amiloride (yM) and terbutaline (|iM) were perfused onto the mucosal 
surface beginning at the times indicated. Under basal conditions (Vt) was electrically 
negative. Perfusion of amiloride onto the mucosal surface inhibited (V^ by blocking apical 
20 Na + channels; 



Figures 28 A and 28B show transepithelial voltage (V^ across the nasal epithelium of 
normal human subjects (Figure 28A) and patients with CF (Figure 28B). Values were 
obtained under basal conditions, during perfusion with amiloride (|iM), and during perfusion 

25 of amiloride plus terbutaline (jiM) onto the mucosal surface. Data are from seven normal 
subjects and nine patients with CF. In patients with CF, (V^ was more electrically negative 
than in normal subjects (Figure 28B). Amiloride inhibited (V^ in CR patients, as it did in 
normal subjects. However, V t failed to hyperpolarize when terbutaline was perfused onto the 
epithelium in the presence of amiloride. Instead, (V{) either did not change or became less 

30 negative, a result very different from that observed in normal subjects; 

Figures 29A and 29B show transepithelial voltage ( Vt) across the nasal epithelium of 
a third patient before (Figure 29 A) and after (Figure 29B) administration of approximately 
25 MOI of Ad2/CFTR-1 . Amiloride and terbutaline were perfused onto the mucosal surface 
35 beginning at the times indicated. Figure 29A shows an example from the third patient before 
treatment Figure 29B shows that in contrast to the response before Ad2/CFTR-1 was 
applied, after virus replication, in the presence of amiloride, terbutaline stimulated V t ; 
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Figures 30A-30F show the time of course changes in transepithelial electrical 
properties before and after administration of Ad2/CFTR-l. Figures 30A and 30B are from 
the first patient who received approximately 1 MOI; Figures 30C and 30D are from the 
second patient who received approximately 3 MOI; and Figures 30E and 30F are from the 
5 third patient who received approximately 25 MOI. Figures 30A, 30C, and 30E show values 
of basal transeptithelial voltage (V{) and Figures 30B, 30D, and 30F show the change in 
transepithelial voltage (AVj) following perfusion of terbutaline in the presence of amiloride. 
Day zero indicates the day of Ad2/CFTR-1 administration. Figures 30A, 30C, and 30E show 
the time course of changes in basal V t for all three patients. The decrease in basal V t 
1 0 suggests that application of Ad2/CFTR-1 corrected the CF electrolyte transport defect in 
nasal epithelium of all three patients. Additional evidence came from an examination of the 
response to terbutaline. Figures 30B, 30D, and 30F show the time course of the response. 
These data indicate that Ad2/CFTR-1 corrected the CF defect in CI" transport; 



1 5 Figure 3 1 shows the time course of changes in transepithelial electrical properties 

before and after administration of saline instead of Ad2/CFTR-1 to CF patients. Day zero 
indicates the time of mock administration. The top graph shows basal transepithelial voltage 
(Vt) find the bottom graph shows the change in transepithelial voltage following perfusion 
with terbutaline in the presence of amiloride (AVt). Closed symbols arc data from two 

20 patients that received local anesthetic/vasoconstriction and placement of the applicator for 
thirty minutes. Open symbol is data from a patient that received local 
anesthetic/vasoconstriction, but not placement of the applicator. Symptomatic changes and 
physical findings were the same as those observed in CF patients treated with a similar 
administration procedure and Ad2/CFTR- 1 ; 

25 

Figure 32 shows a map of the second generation adenovirus based vector, PAV; 

Figure 33 shows the plasmid construction of a second generation adenoviral vector 6 
(Ad E4 ORF6); 

30 

Figure 34 is a schematic of Ad2-ORF6/PGK-CFTR which differs froqi^d2/CFTR in 
that the latter utilized the endogenous Ela promoter, had no poly A addition signal directly 
downstream of CFTR and retained an intact E4 region; 

35 Figure 35 shows short-circuit currents from human CF nasal polyp epithelial cells 

infected with Ad2-ORF6/PGK-CFTR at multiplicities of 0.3, 3, and 50. At the indicated 
times: (1) 10 fiM amiloride, (2) cAMP agonists (10 forskolin and 1 00 fiM IBMX, and (3) 
1 mM diphenylamine-2-carboxylate were added to the mucosal solution; 
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Figures 36A-36D show immunocytocheraistry of nasal brushings by laser scanning 
microscopy of the Rhesus monkey C, before infection (36A) and on 7 days (36B); 24 (36C); 
and 38 (36D) after the first infection with Ad2-ORF6/PGK-CFTR; 

Figures 37A-37D show immunocytochemistry of nasal brushings by laser scanning 
microscopy of Rhesus monkey D, before infection (37A) and on days 7 (37B); 24 (37C); and 
48 (37D) after the first infection with Ad2-ORF6/PGK-CFTR; 

Figures 38A-38D show immunocytochemistry of nasal brushings by laser scanning 
microscopy of the Rhesus monkey E, before infection (38A) and on days 7 (38B); 24 (38C); 
and 48 (38D) after the first infection with Ad2-ORF6/PGK-CFTR; 4 

Figures 39A-39C show summaries of the clinical signs (or lack thereof) of infection 
with Ad2-ORF6/PGK-CFTR; 

Figures 40A-40C shows a summary of blood counts, sedimentation rate, and clinical 
chemistries after infection with Ad2-ORF6/PGK-CFTR for monkeys C, D, and E. There was 
no evidence of a systemic inflammatory response or other abnormalities of the clinical 
chemistries; 

Figure 41 shows summaries of white blood cells counts in monkeys C, D, and E after 
infection with Ad2-ORF6/PGK-CFTR. These date indictate that the administration of Ad2- 
ORF6/PGK-CFTR caused no change in the distribution and number of inflammatory cells at 
any of die time points following viral administration; 

Figure 42 shows histology of submucosal biopsy performed on Rhesus monkey Con 
day 4 after the second viral instillation of Ad2-ORF6/PGK-CFTR. Hematoxylin and eosiif 
stain revealed no evidence of inflammation or cytopathic changes; 

Figure 43 shows histology of submucosal biopsy performed on Rhesus monkey D on 
day 1 1 after the second viral instillation of Ad2-ORF6/PGK-CFTR. Hematoxylin and eosin 
stain revealed no evidence of inflammation or cytopathic changes; 

Figure 44 shows histology of submucosal biopsy performed on Rhesus monkey E on 
day 18 after the second viral instillation of Ad2-ORF6#GK-CFTR. Hematoxylin and eosin 
stain revealed no evidence of inflammation or cytopathic changes; and 
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Figures 45A-45C show antibody titers to adenovirus prior to and after the first and second 
administrations of Ad2-ORF6/PGK-CFTR. Prior to administration of Ad2-ORF6/PGK- 
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Nucleotide Sequence Analysis (cont) 

e70i irccocxnKx* ooci w rw tccaocaoao oooqcoqcoc ttoooogxac aoaatooooo 

6761 TXGTCGGTCT AOCTO O OlCr OOTCCOOCXW OTCTOOOTCC AOCttTAAAOA OCOOQUCAG 
6621 CMOOOOOCO TOOAAOT&OT CTAICTTOCA TCCTIOCAM TCTAOOOCCT OCTOCCATOC 
6661 OOQOOCOGCA AOOOOQOdCT OJwOTMOQOX CCCCATO3CA VZ&OOIQOQT 

6941 0*000000*0 OCOEACATOC OQCAAATOTC CTAAAOCTAO A0Q9GCTC1C TCAGTATTCC 
9001 AAOATATCTA OOOCMCASC TTCCACOQOQ QXTOCTOOOO OQCAGOPAAT CGXATA0RC 
90*1 OTOOCUOOOA OCOAOOAOOT OOOttCOOM OXTOCTACOO OCQQQCTOCT CTOCTOQOAA 
9121 OACTATCTOC CTOAAOATOQ CATOTOAOTT OOATOAXATO OTTOOACOCT OOAAO A OQTT 
9161 OAAOCWQCO ICTOTOAOAC CTACOQCGit AOGCACOAAfl QHXKXttfiOQ AOIQKOCA3 
9241 CTTOTO3ACC AOCTCOOCOa TCACCWCAC OTCTMOOOC CACTACTCCA OOGTTTOCTT 
9301 OATGATCTCA TACTTATCCT O TO OC T TTCT TTTOCACAOC TOOCGOTTOA OGtACAAACTC 
9361 TTCOCOOTCT TTCCAOTACT CTTOOASCOQ AAAOOOOTOO OCCtCOGAAC OOTAAOA0CC ' 
9421 TAGCATOTJU3 AACTOOTTOA OOOCCTOOTA O0WCAOCAT OOCTTTTCTA QQQOT AQCQC 
9461 OTATOCCTOC OOOOCCTTCC OOA0OOA0OT OTOQOTOA0C OCAAAGGTOT COCTAAOCAT 
9541 OACTTTfcMG TACTOOTACT TOAAOTCAGT QTOQTOGCXT CC OCCC TOCT CCCAOXOCAA 
9601 AAAOTCCOTG OOCTTTTTOO AACQOOQQTT TOOCMOOOOO AAOOTOACAT COTTCAAAAG 
9661 TJ CT C CT IOCC OCOCGAOQCA TAAAOTTOCO TOTOATOCOQ AA000TOC03 OCACCTOMA 
9721 ACOQTTGTTA ATTACCTOOa COQCOADCAC CAJCTOOTOQ AAOOOTTOA OTTOI OQCC 
9761 CACOATOIAA lOCTGGAAOA AOOQOQQGGT GCOCTTCATO OXMQCAATT TTTTAAQTTC 
9641 CTCGTAGOTO AOCTCCTCAO OQOAGCTOAO COCCTOTTCT OXCAOOOCCC AO1CT 0 CAAQ 
9901 ATWOOOTTO OAAOOOAOOA A3OA0C1CCA CAOQT CAOQQ OOCAXTAOCA TOOCMQTO 
9961 O1COCOAAA0 OTCCTAAACT OQOOAOCTAT OOCCATTTTT TCTOGOGTOA TOCMTO OAA 
.10021 QMkhSCQQO f CTTQTT OOC AflOOtlTOOCA TCCAAWtCC ACMCTMOT C9O00Q0OOC 
10061 OOICACCAOA OOCTCATCTC OOOOOAACTT CAJPAACCAOC ASX1AA0OOCA CGAOCTOCTT 
10141 CCCAAAOQCC CCCATOCAAO TATAOOICTC TACATO0TM OXOACAAAOA OAOOCTOOOT 
10201 OOOAflOATOC OAOCOQA3CO OQAAOAACT0 OATCTCCCQC CACCAOTTOG AOCAOTQQCT 

10261 arrokwmo tuaaaotaoa aoiccctoco acoo xc q aa cactoctxt oocTmarx 

10321 AAAACOTOOO CAOTACTOQC A0OQGTQCAC OOOCtOXACA T0CTOCAOOA OOTTUACCTO 
10361 AOOACOOOCIC ACAA0OAA0C AOAOXOOOAA TTTOAOOC0C TOOOMQQCO QOTT TOOgTO 
10441 GTQQTCTTCT ACTTOQOCTO CTTOTOCTTO ACOCTCTWC TOCTOQAOM QA QTTATO QT 
10501 OOASCOOACC ACCACOCCOC OCCA flCCC AA AOTCCAflATO TOCOCQOOOO OCOOTCOOAG 
10561 CITOXTOACA ACA1COCOCA OATOMfcJCT QTCCATOCTC TOCACCTCCC COQQCttATAf3 

10621 oicAoocoaa aoctcctoca ootttaoctc qcataocoqg otcaoooooc ocqctaqctc 

10661 CAO5T0ATAC CTOA3TTOCA OOOOCTOCnT O0TOOOMCO TOGATOACTT OCAAOAOOCC 
10741 OCATCCCOGC OCCOCGACTA COOTACOQOO OCXXXSOO0QG T QQ00000QQ QO flTO T Ct T T 
10601 QOATCATOCA TCTAAAAOCO OIGAOCOQOC C O OQO C COnQ aAOOTAOOOO OOOCTOMOA 
10661 CCCOCCOOaX OAOOOOOCAO OOOCAOCTOC GOOOOO00CX3 COOOCAOOAO CTOOTOCTOC 
10921 OGOCOOAOOT TOCTOOCCAA COCGAOCACO COOOOOCTOA TCT OCTOAA T CTOOOQCCIC 
10961 TCXCTQAAOA COAOQQQCOC OCTtOMCTTO aXCCTGAAAC AGAOROQAC AQAA3CAATT 
11041 TtOUtOTC&T TOAOOOCOQC CTOOOOCAAA ATCTCCTOCA CMCTOCTOA WltftX-llUA 
11101 TMOCOATTT COSCCATCAA CTQCTOQATC 1CTTCCTDC T OQAfiATCTOC QCQTOCQGCT 
11161 COCTCCACOO TOOCXWOCAO CTQOTTCWAfl ATOCOOOCCA TCAOCTQOCA CAAOCJOOTTC 
11221 AOOCCTCCCT CCTTCCA3AC QOQOCTOT A S ACCACGCCCC CTTCQOCATC OOQOOCOOGC 
11281 ATOACCACCT OCOCOAOATT CAOCTCCACQ TOCCOOOCOA ACAOOGCCTA OTTTCOCAOC 
11341 OOCTOAAAOA OC3TAOTIOAG OCTOOTflOCO OlUmTTCTC CCAOGAAGAA CTACATAACC 
11401 CAGOOTCC5CA ACOTOOATTC OTTOATATCC OOCAAQOCCT CAAOOCOCTC CATOQCCTCQ 
* 11461 TAOAAOTCCA OOOCGAACTT CAAAAACTCO OAflTTOCOCa CCOACACOOT TAACTCCTCC 
11S21 TCCAflAAOAC OOATOAOCTC OOCOACAOM TOOCC3CACCT COCOCTCAAA QCCTACAOOG 

11581 occTcnnrr cttcaatctc ctcttcxata acqccctccc crrcrrcrrc tvcttctcgc 
H641 Qocxxrroaaa oaooooooac aooocxxkoa oqaojcxxjca ccooawoco otcoacaaac 

11701 COCTCOATCA TCTCCCCQCO OCX3AOOOC30C ATQOTCTCOG TG AOQQCQCC CCCCTTCTOG 
11761 CCOOOCCCCA GTTOGAAQAC OCXOCCCOTC ATCT0COQCT TATWWTTOG COOOGGGCTG 
11821 CCGTOOOCCA <X3<MlTACOOC OCTAAOCATC CATC1CAACA ATTOTTCTCT AflCTACTCOG 
11661 CCACCCAGOG ACCTOAOOOA <«COCCATCC ACCOGATOCaO AAAACCTCTC GAOAAAOOCG 
11941 TCTAACCACT CACACTCGCA AOCTAOOCTO AOCAOOOTOG OOOCXXJOCAC COOCJIOCXXX; 
12001 TCOOQCTTCT nCTOOCOaA QgT Q CTCCTS XTCATOTAAT TAAACTAOOC GGTCTTOAOA 
12061 COCCGCATOG TCGACAGAAC CACCATOTCC TTOC5GTCCGC CCTOCTCAAT GCGCAOOCGC 
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Nucleotide Sequence Analysis (conL) 

12121 TOCGCCATOC CCCMOCTTC OTTTTXJACAT COGCOCA0GT CTTTCTAQTA OXCTTOCATC 
12161 A0OCTTTCTA CCGGCACTTC 1TCTTCTOCT TOCXCTTOXC CTOCA3CICT lOCASCEASC 
12241 GCTACGOCOG OOQCOGAOTT TGOCOGTAOO TQUOGCCCTC YTOCTOCCAT OOGTOIUACC 
12301 CCGAAGCCCC 1CATCOOCTO AAOCAOOOCC A0OTOGOOGA CAAOQ OQCTC OOCTAAXATO 
12361 QCCltXTOCA CCTGCOTOAO OOTAOACIOa AAGTCA2TCCA TOTCCACAAA Q030TQQTAT 
12421 OQ3 CC CX7T9T TaATOOTOTA ACT9CAOTTO OCCATAAOGG ACCAOTTAAC OGTCTQGTOA 
12481 CCOG GCT QOO AGAGCTCOGtf GTACCTGACA OOOGAGTAAO COCTTOA0TC AAAOAOQTOO 
12541 TCQTTGCAAfl TOOOCACCA& OTACTGATAT CCCACCAAAA AOTOCOOCOC 09GCTOG0GG 
12601 TAGAOOOOCC OOCCOOOOCT CCQGOGGCOA OOTMTCCAA CATAAGOOGA 

12661 TOATATOOQT AGAXUTACCT OOACATOCAG 0TGATOCCO3 OOOOOCmwrr QQAQGOQOGC 
12721 OGAAAOICCC OOAOOCOOTT OCftOAlGTTQ OOCACCOCJCA AAAAOTOCTC CATOGTOQQG 
12761 ACOC1CMOC COOTGAGGCG TOCOCACTCG TTCACGCTCT AOACCOTOCA AAAGGAOAGC 
12841 CTOTAAGOGG OCACXCTTCC QTOOTCTOQT OGATAAATTC GCAAQOOTAT CATOQQQQAC 

12901 tsACCGGGarr cgaaccccog atoomccot coqccotgat ccArooocrrr accqcccoco 

12861 TOTCGAACCC AOOTOTGOGA CGTCAGAGAA OQQQQQXOOQ CT OCOTW OP C TTOCT T CC A 
13021 OGCOOGGCGG CTOCTOOGCT A0CTTTTTTC GOCACTGGCC 0000300000 TAAQO QOTTA 
13081 OGMOGAAAO OOAAAOCAOT AAOTOOCTOO CTOOCTOTAO OOOGAGOOTT ATTCTCCAAC 
13141 CCTTCAGTOG CAGOAOCCCC OGTTOflAOTC 1CQQQCOQQC OOQACTOOOO OOAACQOOQG 
13201 TTTQ C CTOCC COTCAT9CAA QACOCOOCTT OCAAATICCT COMAAACAO OGAOGA0GCC 
13261 CTTCTTTQCr TRCCCACAT ©CATOCOOTQ CTOOOOCAGA TOCOCCCCCC TOCTCA0CAG 
13321 CGOCAAGAOC JUWOAQCAQCQ OCAOACATCC AOOOCACCCT COOCTICTCC TACOQOQTCA 
13381 GGAQOOGCAA CAXCCOOOOC TGAOOOOOOO GCAGATOOTO ATTAOOAACC GCCOCOOOOC 
13441 iOOOQCOGGOC ACTACCTOGA CTTOGAOGA3 GOOOAGOOOC TOGOOCOOCT AOGA3COCCC 
13501 ECTCCIQAOC OACACCCAAG OOWOOCT3 AAOCOTOACA CQOQO QAQQC OTAOOTOCCG 
13561 OQQCAQAACC TOTTTCGOGA OOQOQAOQQA QAQQAOOOOQ AQ QAflATQOQ OGATCGAAAC 
13621 TTCCAOOCAO OOCOCOA0TT GOOOCAPOOC CTOAACOOOO AO030TT0CT OCOCOAOQ A g 
13681 OACTnOiOC QOGAOQOGOG OAOCOOGATT AOTOOOOOOC OCOCACAOGT OQCQO COOOC ' 
13741 GACCTOGTAA CCOCOTAOGA OCAOAOOOTG AAGCA0GAO3 TTAACTTTCA AAAAA OCTTT 
13801 AACAA0CAOG TOOOCACCCT TOMOOOCOC 0AOG3OCTGO CTATAOGACT QATOCATCTO 
13861 fOOGACTXTO TAAOCOCOCT OQAQCAAAAC CCAAATAOCA AQC OQCTeAg OOCOCAOCTO 
13821 TTCCTTATAO TOCAOCACAB GAOOGACAAC OAOOGATTCA OQQATO COCT OCT AAACATA 
13861 OTAOACXXCG A00OOOQCTO OCTOCTO Q AT TTOATAAACA TTCT3CAOA0 CATAOTOOTQ 
14041 CAOGAOCOCA GCTXCAOCCT OGCTGACAAC OTOOCOOCCA TTAACTATTC CATOCTCACT 
W101 CTOOGCAAGT TXTAOGOCOG CAAOATATAC CATOOCOCTT AOCTTCCCAT AOACAMQAC 
14161 OTAAAGATCG A9GGGTTCTA CATOOOCATG OOGTTCAAOG TOCTTACCTT GAQOQAQQAC 
14221 CtOOOOOTTT ATOGCAAOGA GC5GCATCCAC AAOGCOGTGA OCOTOAOCOG GOGOOOOMC 
14231 CTCAGCQACC GCGAGCTQAT OCACAOCCTC CAAAOOOCCC TOGCTOGCAC OGOCAGOGOC 
14341 GATASAGAGG CCOAGTCCEA CTTTOADGCG GGOGCTGAOC TOCOCTOOGC COCAAOOCOA 
14401 COOOCCCIOG AOOCAOCICC OOCXXWACCT OOOCTGOOOG TGGCACCCOC CQGOQCTOQC 
14461 AACGTOGGCC GCGTGGA9GA ATATGACGAG GACOATOACT AOGAGCCAOA QM O00 OO8Q 
14521 TACTAAOOGG TCATOTTTCT OAICAGATGA TOCAA3ACOC AACQQACOOQ OOGGTOOGGG 
14561 OGGOQCTOCA CAOCCAOCOO tCOGCOCTTA ACTCCAOGGA OGACT0O0GC CAGGTCATGG 
14641 ACCOCATCAT OTOQCT Q ACT OOGOGTAACC CTGAOGCOTT CCGOCA0CAG OOOCAOOCCA 
14701 ACCOOC1CTC OGCAATTCTO OAAOOOOTOG TCOCOGCSOOG CGCAAACCCC ACQCAOGAQA 
14761 AOOTOCTQGC GATOOTAAAC OOCCTGOOCG AAAACAGGGC CATCCOGCCC GATGAQQCCG 
14821 GCC2«SICTA OGAOGOOCTO CTPCAOCOCG lOGCTCCTTA CAACAOCOGC AAOCTOCAflA 
14881 dcAACXntWA COGOCTOOTO OGOGATCTCC GOGAGC5CCGT GGOGCAGCGT GAOCOCGOCC 
14941 AGCM3CA90G CAACCTOOOC TCCATOGTXC CACTAAAOGC CTJCCTCAOT ACACADCCCG 
15001 CCAACGTOCC GCOOGGACAO OAOGACTACA OCAACTTTOT QAQOGC ACTQ COOCTAATOG 
15061 TCACTCAOAC ACOGCAAAGT GAOGTGTACC AGTCCGOGCC AGACTATTTT TTCCAGACCA 
1S121 GTaGACAAGG CCTOCAOACC OTAAACCTCA OCXMSOCTTV CAAGAACTTO CAOOOOCOTT 

15181 GGGGGGTOCG ggctcccaca ogcgacogog ogaccgtotc taocttoctc adocccaact , 

15241 COCGCCTOTT GCTOCTCCTA ATAGCGCCCT TCACGGACAG TGOCAOCGTC TCCCOGGACA 
15301 CATACCTAGG TCACTTOCTO ACACTGTACC OCGAGGCCAT AOGTCAOGCO CATCTGOACO 
15361 AOCATACTTT CCAGGAGATT ACAAGTGTCA OCCQCOCOCT OOGCCAGGAG GACAOGGOCA . '« 
15421 GC CTGGAGGC AACCCTCAAC TACCTGCTOA CCAACCQGCG GCACAAGATC CCCTOCTTCC 
15481 ACACTTTAAA CAGCGAGGAG GAOCOCATCT TCCGCTATCT GCAOCAOAGC GTGAGCCTTA- 
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Nucleotide Sequence Analysis (cant.) 

1SS41 ACCTQATOOO CGACGGGG1A ACGCCCAGOG TGOOGCTGGA CAltSACCOO© OQ CAAC ATOfl 
15601 AACOQOOCAT G7ATGCCTCA AACC00CCOT TTATCAATCG CCTAATCCAC TACTIOCA3C 
1S661 OCOOOOCCOC COTOAACCCC GAOTATTTCA CCAATOCCAT CTTOAACOOG CACTOQCTAC 
1S721 OGCCCCCXGG TTTCTACACC OOOOGATnO AGGTOOCCOA OO OTAXOOy O ggTCCT CT 
15781 OOGACOACAT AGACGACAQC tf H««CC C CGCAAOCOCA CAOCCTGCTA CACTTGCAAC 
15641 AGCGOGAGCA OGCAGAOGOG OCGCTGOGAA AGGAAAGCTT OOOCAQQOCA AOCAOCTTOT 
1S801 CCOATCTAGO COCTOCOOOC CCQCOOICAG ATCOGAGTAC OCCXTTTOCA ACCTTGATAO 
15961 OOTCTTTTAC CAOCACTOOC ACCACCCOCC OOCGCCTOCT QOGOGAOGAO GAGTACCTAA 
16021 ACAACroqCT OCTOCAOCCO CAOCOCGAAA AGAACCTGCC TCOOQCAOTT CCCAACAAOO 
16081 GGATAGAGAO CCTAOTaOAC AAOATOAGTA GATGGAAGAC OTA TOOQC AO QAQCACAGQG 
16141 ATOTOCOCOQ CCCQCGCCCC CCCACCCOTC OTCAAAQOCA CGACCQTCAG COOGGTCTOG 
16201 TGTOOGAOGA CGATCACTCC OCAOAOOACA OCAGOOTCCT QOATTTQOGA OGGAGTOOCA 
16261 ACCCGTHC C GCACCTTCOC CCCAOGCXOG OGAGAATGTT OTAAAAAAAA AAAAAAAAAC 
16121 CATOATOCAA AATAAAAAAC TCACCAAOOC CATOGCACOG AOOOTTQOTT TTCTTOTATT 
163*1 CCCCTTAGTA TCCAOOOOOC OQCGATQTAT GAGOAAGOTC CTOCTCCCTC CTACCAGAGC 
16441 SroSSoOO OtaOOOCCAOT GGCOOOOGCO CIOQOCTOCC CCTTOGATQC TCOCCTO3AC 

16S01 cco cc om o wcciccgcg otaccxocog cctaccoojo ggagaaacag catocotoc 

16561 1CTOAGTTGG CACCCCTATT OOACACCACC CCTBTOTAOC TTOT QOACAA CAAGTCAAOG 
16621 CATOTOQCAT CCCTOAACTA CCACAAOGAC CACAOCAACT TTCTAAOCAC OG TCAJTCAA 
14681 AACAATCACT ACAGCCCOOO GGAOOCAAOC ACACAOACCA TCAATCTOGA OGACOOTTOG 
16741 CACTOGOOOO OCOACCWAA AACCASCCT9 CASACCAACA TO0CAAA3OT GAAOOAOtTC 
16801 AIOTTTAOCA AXAAGTTTAA 000X303070 ATOOTOTOOC OCTOQCTTAC TAAOOACAAA 
16861 CAGOTOOAGC TOAAATATCA QTOQOTOGAO TTCACOCTOC C OCA O QQ C A A CTACTOCGAG 
16821 ACCAXOACCA TAOACCTTXT GAACAACOCO ATOOTOOAOC ACTACTTOAA AQTOOOCAOO 
16881 CAOAACOPCC TTCTOOAAAO COACA1CO00 OTAAAOTTTO ACAC C OOCA A CT TCAGACT Q 
17041 GOOTTTSACC CAOTCACTOG TCTTOTCATC CCTOQOQTAT A T ACAAAOQA AGCCT TCCAT 
17101 CCAGACAfCA TTTTOCT OC C AGGATOCGOG GTOGACTTCA OOCACAOCOO CCTOAOCAAC 
17161 TICTTQOCCA TCCGCAAOOO OCAACCCTTC CAOGAOOOCT TX8O0A3CAC C TACOAT OAC 
17221 CTOGAOGOTO OTAACATTCC OGCACXOTTG CA tOTQCAO O OCEAOC A OQ O AAOCOTAAAA 
17261 -GATGACAGOG AACAGOOOOO GGAXOOCGCA QQOOOCGGCA ACAACAOTOO CAOCGOCOOG 
17341 GAAGAGAACT CCAAOOOOOC AGCCOOOOCA ATGCACCOGC TOQAOOACAT QAACOA TCAT 
17401 OCCATTCGCG OOGACAOCTT TOOCACACOO GOGGAOGAGA AOOQOOCTOA OOCCOAQOCA 
17461 OGGOCAGAAO CIOCCGCCCC COCTOCGCAA OCCOAOGTOO AGAAOCCTCA GAAGAAAOOG 
17S21 GTOATCAAAC CCCIGACAOA GGACAOCAAG AAACflCAOTT ACA ACCTAAT AAflCAA TQAC 
17581 AGCACCTTCA CCCAOTACCO CAOCTG3TAC CTTOCATACA ACTACGGOGA OCCTCAGACC 
17641 GOGATCCGCT CATOOACCCT CCTTTOCACT CCTOACOTAA OCTOOQOCIC OGAOCAOCTC 
17701 TACTQGTCGT TGCCAOACAT OATQCAAGAC OCOOTOACCT TOOOCTCCAC CAOOCA CATC 
17761 AOCAACTTIC Ct^TGQTOQO O3CCOA0CTC TWMXCQTOC ACTCCAAOAC CWTACAAC 
17821 OACCAOOCCG TCTACPCOCA OCTCATCOOC CACTTTACCT CTCTOACCCA OOTOTTCAAT 
17881 CUCm^OCU AGAACCACAT 1TTOQ0QCOC OCGCCAGCOC CCACCA TCAC CACOGTCACT 
17941 CAAAA031TC CTGCJCTCAC MATCAOGGG ACGCTACCOC ro O QCA AC AO CATOGGAGGA 
18001 GTOCAOOGAG TGACCATTAC TCAOOOCAOA COCCOCAOCT OC^CTACOT JThCMPOX 
18061 C1WXJCATAO TCTOGCCOCG OOTCCTATCG AGOCOCACTT TTTGAGCAAA CATSTOCATC 
18121 ' CTTATATOOC CCAGCAATAA CACAGOCTGO QGCCTGOQCT ^COU«CAA GKWOTMC 
18181 OGOOCAAAGA AOCOCTCOOA OCAACACOCA CTOCOCCTGC OOOOOCACTA CCOOGCGCCC 
18241 TCOOOOOOGC ACA>»CG«K CCOCACXOGO COCAOCACCO TOOATGAOCC OOTGAOGCG 
18301 CTGCTOCaOO AGOCGCGCAA CTACACGCOC ACOOCCCCAC CAOIQTCCAC ACTGGAGGOG 
18361 GC^ATTCAGA COOTQGTGOG CGOAGCCCGO CGTTATGCTA AAATGAAGAG AOGGGOGAGG 
18421 COOCTAOCAC GTOGCCACCG CCOCCGACOC GGCACTOCCG COGAAOOCOC OOCQQCQGCC 
18481 CTGCTTAACC OCOCAOGTOG CAOCOOCOGA OOOGOOOCCA TGCOOOCOGC TCGAAOGCTG 
10541 GCCGCOGOXA TTOTCACTOT OCCCOOCAGG TCCAGGCGAC OAGOOGCCOC OOCAGCAGCC 
18601 OCGGCCMTA GTGCTATGAC TCAGGOTOGC AGQGGCAAOC ^ACTGGGT f«CCCACTCG 
10661 GTTAGCGGCC TGiXXXHOCC CGTGCGCACC <«XCCCCOC OCAJ^TAGAT 
18721 AACTACrrAC ACTCGTACTO TTOTATfiTAT CCAOOTOCOG OMOOOOOA CGAAGCTATG 
10781 TCCAAGOGCA AAATCAAAGA AGAGATOCTC CAGCTCATOO COCOOGACAT CTATGGCCCC 
10841 cccaagaaIg aaoaSaoga ttacaagccc CGAAACCTAA AOCOOCTCAA AAAGAAAAAG 
18901 AAAGATCATO ATOATSATCA ACTTCAOOAC GAOGTGGAAC TOCTGCACGC AACCOCOCCC. 
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10961 M9COOOOM TACAOTOOAA AflOTCCAOOC ClAASACCTO «™MAOC O OTC^ CC *C C 

19021 OTjarrrre* ccocooctca ccjoctocacc oocacctaca mwomwa tomwjooto 

19081 TACOfiCCAOO AOOACCIGCI' 1CMCAOOCC AACQAQCOCC 1C0OOOAOTT TBCCTACOOA, 
19141 AA9C00CATA AOOACAIOIT O000XTOC08 CtOOACOAQ© OCAAOCCAAC ACCTAQOCTA. 

Sioi AMCCoSax cactocaoca oarocroocc amcttocac asicor^AOA axaooooooc 
i92«i ctaaaccoco *ororoox<» cxtoqcaooc nxotccaoc taaoaoACC caaococcao 

19321 oSciSaAAO AOTTCTTOOA AAAAA1GACC OTOOAOCCTQ OQCTOGAGCC COAOCTOOOC 
19381 OTOOOQOCAA TCAAOCAOOT OOCACCOOflX CTQOOOOJOC AGACCOTMA COTTCAOATA 
19441 CCCACCACCA OXAGCACTAO TATTGCCACT OCCACAOAOQ GCATQOACAC ACAAAOOXCC 
19501 COOOTTOCCT COOCOOOWJC AOATOCOOCO OTOCAOOCOO CCOCTOOCOC OOOQTOCAAA 
18561 ACCTCTACQO AGOTQCAAAC OGACCOOTOO AlOTTTCOGO TTTCAGCOOC CC^OCCCO 

19621 cQccorr«e* ogaaotacoo caccoocaoc ocactactoc ccxjaaiaioc cctacaxcct 

19601 TCCATCQCQC CTACCCCCOO CTATCCTOGC TACACCTACC OCCCCAfflUkJ ACOAOCCACT 
19741 ACCXXUkOOOC GAACCACCAC T0C3AA0C0GC COOOOCOOTC OCCOTOOCCA OCCCOTOCTC 
19801 OCCCCOATTT CCQTOCOCAO OOTO0CTCCC CAACCAQOCA Oa AOOCTaQT «CTOCCAACA 
19*61 COOOOCTACC ACCCCAGCAT COTRAAAAG OCQCTCTtTB TOCWTOTOC AOA^TOOOC 
19921 CTCACCTOCC OC C1CCO CTT CCCOOTOCCC OOATTOOQJO OAAOAATOCA OOCIMOAOS 
19901 OQCAOTOCOO OCCAOQOCCT OACOOOTOftr ATI3COTOOTO OOCACOCOO GOaOOOOOCC 
20041 OCOTCOCACC (STtMCAMCXI COOCOOXATC CTOCJOCCTCC TTATTCCACT OAMQCOQCO 
20101 OCOATXOOOO 0C&T0CCO0 O AAtTOCATOC OWOCC T T Q C AOOCOCACAC ACAC^RPl 
20161 AAAACAAOTT OCATOTOGAA AAA1CAAAAT AAAAAOTCTO OAOTCTCATO CTOOCTOOCT 
20221 CCTOTAACTA TTTTOTAOAA TQGAAOACAT CAACTTTOC© tCTCTOOOCC «*^<™ 
20201 CIOOCOCCXXJ TTCASOOOAA ACTOOCAAOA TATOOOCAOC AOCAATATOA OMOTOOOOC 
20341 CtTCAOCMO OOCTCOCTOT OOAOCOQCAT TAAAAMTITC OOTTC^ACCA "AAOAACT» 
20401 TQGCABCAA0 OCCTOQAACA CCAOCACAOO CCMMXW *«WACAAOT 
20461 AAATTTCCAA CAAAAQOTOQ TAOATOOCCT <^CV^OC SSSSS 
maaACCAa OCAOIOCAAA ATAAOAXTAA CAOIAAOCtT OATOCOOOOC CT O COO TOtt A 
aSSl 2LSc«CCA OOOOOOOTOO AOACAOTOTC 1CCAOAOOQO COTOOCCAAA A0OOTCCOO9 
SIS £££££ SaaISS? WXCAOOCA AA^TOAC CCTOOMOM A^AOOAOOC 
20701 ACTAAAflCAA 03CCTOCCCA CCAOCOOTCC CAtCQOOCCC £2SS£? 
20761 CCAOCACACA CCTCTAAOOC TQQACCTOCC WOC«MCT «*CAOCCAOC JOAAAOCTOT 
20021 OCTOCCAOOO COQICOOCOQ TTOTTOCAAC ~?OCCTWC JJOOCOTOOC 
20081 COCCAOCOGT COOCOATOOA TOOMWWT 

20*41 CAOCATOOra OOTCTOGOOG TQ&XtCCCt GXkGCQCCGA OMWCYIVJ. amia^im 

S£S£SS SSSS^ io«mxAto SSSS'SSSSS SSSSS 

21061 OCOCCCOCTO TCCAAOATOO CTACCCCTTC OATOATO00O CAflTOOTCTT ACATOACAT 
21121 CtaWSCCAO OAG0CCIC08 ACTACCIOAO CCCCO0OCTC SStSS 
cnkOACfiTAC TTCAOOCTOA ATAACAACTT TAflAAAOCCC ACCCTOOCAC CTAOQCAOCA 
Hill SSSSacI OAcSSSc MCOmOAC OCTOOQOTTC AICCCTOTOO ACCOCCAOQA 

21361 TAlOaCTTCC ACOTACTTTO ACATCCQC03 COTOCTMAC AOOOaoOCTA «TTTAAOCC 
21431 CTACTCCGGC ACTOCCTACA ACCCTCTAGC TOCCAAOOOC OCTCCTAACT CCTOTQACTC 
Sm SSSaaS SaSSS CCOCWOCAOT TOCCOA02AT ^J^OJ ^TCAAOA 
21S41 TQAAGAAOAa OAAOAAGAAC AOCAAAACCC TOOAOATCAO OCTACTAAflA AAACACATOT 
lioJi SSSccac Stcottot ctogacaaac AATTACAAAA ACCCOOCTAC AAATAOOATC 
Se" SSJSS GAAACACAAO CTAAACCTGT AriboCAflAT CCTTCCTATC AACCAOAACC, 
21721 TCAAATTCOC GAATCTCAOT OOAACGAAOC «A«MCTAAT OCOOCAOttAC GCaSACTPSCT 

^78^ ^aaI^aISS actcccatqa aaccatcc^a ™oatcttat ^JOO^ ^atccttt 

il«41 TOCnxXSTCAA TCOGTTCTQQ TTCCOOXTO* AAkKXSQOTQ CCTCTTOCAA AOOTTCACTy 

jsssSa ccit™^ sjss^ 

«Jf i S^Sa^S iSS^ SSSSES 

SSw ^SSJJJ SSS SSSS TAnOCCCTA ATGTATTATA ACAOCACTOG 
SSaTCOC? OTTcSoCTO CTCA60CATC OCACCTAAAT CCOOTOOTAO ATTTOCAAGA 

III*] ca^SSS gaISSS? mJaactctt ccttoactcc atagctcata gaaccacata 

If;?? JSJSaU xqgxatCAGC CTCTAGACAC CTATCATCCA GATCTTAGAA tCATICAAAA 
22261 TTTTTCTATG TOQAAT^w- . TTGTTTTCCT CTTOOOOOTA TTOOOCTAAC 

22321 CCATOCAACT GACGATCAAT TCCCAAATTA 
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.22381 TCACACC1AT CAA0CTA1TA AOGCTAAWG CAAIG0C1CX oocgatx^o ataxugnc 
22441 A3XWACAAAA OXTOXXXCTT TOJCAACACC TAA1CAAATA flMOTOOO^ 
22S01 CAIOOAAATT AACCTAAATO CCAACCTATO Q^OXAATCTC CTWACTOCA MMTOOOCT 
22S«1 CTACCTOCCA OACAACCTAA AATACAACCC CACCXATOTO «MM*«W SSSSSJ 
22621 CACCTACQAC TACATCAACA AGCCACTOCT OOCTCCCOOO CTTO^GACT ^^JTTAA 
23681 CCTTOOOQOG OOCTOQICTC TOOACTACAT OOACAAOOTIP AjU£««™ JOCAOCAOOO 
23741 CAATGCOOOC CTCCX3TTATC OCTCCATOTT OTTOOOAAAC OOOOOCTACO TOOCCTTXCA 

22801 cattcaqqto ccccaaaaot ttittoccat taaaaaoctc ctcctcctoc caooctcata 

22861 TACATAtSa TGGAACTTCA OOAAOCATOT TAACATtWTT CTQCHGAGCT CTCTOOOAAA 
22921 CQATCtSoA GTTOAOQCCC CEAOCATTAA OTTTOACAOC ATHOTCTTT ACOCCACCTT 

2^2 S££££ SSSSS ooocctxxac acr^^cc a^ctcaoaa atoacaocaa 

23041 COACCAOTCC TTTAATCACT ACCTTTCCOC C^CAACATO CTXTACOCCA TAOgCOCAA 
23101 CQCCXCOAC OTOCCCATCT CCATOCCATC OCOCAACTOa OCAOCATTTC OOQCrrOOOC 

SJtt SSSScoc ttqaaSSa aooaaacocc ttccctocma tcaooctacq ACCCTTACTA 

moot atwymanr xrakCCTTlO ACTCTTCTOT raOCTOGOOG 00CAAOOACC OCCTOCTXAC 
SUSSES A*™* £3S2 Oj™, ^CTCAOTJ 

SIS. 2£?3£ ii£5£S SSSES iSSSSS SSSJSS 

SiS otoaccatac taaatacaao oaotatcaoc ao^aat 

Hill ASSISE CATAACAACT CAQGATTOOT AOOCT^CC «*CCCACCA W^QAOOO 
23641 ACAOOCTIAC CCCOCCAAOfl T00CCTAO0C ACTAATA QQC AAAACCOCOQ ^OACAOTAT 

23881 OTITOAAOTC TrtCACOTOO TCOCTOTOCA OCAOCOOCAC OTOJOTXCA TC^COTT 
23941 OTACCIOOdC ACOCCCTTCT OOOCCMCAA °°CCAO*CA mU^C ^SCAA^C 
24001 AACAACACCT OCOOC CATOO OCTCCACTCA OCAOOAACTW JJJOCCAWO 
»*o«i •mrmnoa CCATATMTT TOOQCAOCTA TQACAAOOOC WTCCMWCT TXOTXTCTUC 

m!S Smw Sotcaatac ooccogtooc qaoactocjoo oootacacto 

U"i SSSJSS SSctcaaa aacatomac ctctxtoaoc cctttooctt 

24301 CQCCAMOCT TCTTCCCCCO ACCOCTOTAT AAOOCTOOAA £^ACCC 

24361 OOOOOOCAAC TOOOCCOCCT CIOOAOTOT CTOCTOCATO TTTCJ^^ JSSSaA 

CTO^CTOCTT AACAOTOTCC AQCTACAOCC CACCCTOMT OGCAADCAOQ AACAOCTCTA 

34601 CACTICTTTT TX7TCACTOA AAAACATOTA AAAATAATOT ACraeCAOAC ggggg* 

24661 AAOOCAAA-K TTTTTATTTC ^ACTCTM £££££ 

24721 lUXAXmTt AAAAATCAAA OOOOTTCTCC COCOCATOOC JSSSSS^ 

24781 ACOTTOOOAT ACTOOTCTTT SootSo CAOCToSgC 

24961 OaOrroCAGC ACTOCSAACAC TATCAOOOCC «0OTOOIOCA "CTOOWAC CAOOCTCTTG 

,„„ - V¥M1/11 ~. > cATcocsanc caogtcctcc ocottcctca oooocaacoo agtcaacttt 
Hill SSSS^ SSSaa oootocatoc ccaoocttto aowocactc ocacootact 

2S 50 i S^SSi SSa^S iss^S SSSSac 

25201 Z^^T SSSSS SSSSS SSSS^ 

SS. SSSSJi SSSSS SS^SS SSSncA coawtiox: c^ctaoac 

25321 TTCOAOATCT «^*trJX^t £m«moO CTCGTCACAT CCATTTCAAT CAO0TOCTCC 
?2S ^SSJS SSSS SSSSc CTTOOATCtC AOCOCACCGG 

?SS MISSIS IJSSSc StSgctcg togtgcttot aqottacctc tccaaaccac 

25501 TXSCAGCCACA ACSOCCAOCC COTa^^ >TCCTCACAA A0OTCTTSTT OC1COTQAAC 
^61 ^AOOJJCJ CCTOCAOGAA ^SS^ TOCATAOOOC COCCACACCT 

""1 CTCAOCT3CA JfCCQCOCTG g^S^ CCCTTTAOAT COTTATCCAC CTCCTACFrC 
25681 TCCACTK5GT CAGCCAGTAO TTC1CCCACO CACACACOAT COGCAGGCTC 

25741 TCCATCAACG CCCCCCCAGC CTCCATCCCC 



WO 94/12649 




PCI7US93/11W7 



Nucleotide Sequence Analysis (cont) 

2S801 AOCOQOTTIX TCACCOTCCT TTCACTTXCC OCTICACTOO ACTCT TO CTT TWCTCTICC 
2S861 OTCCOCXTAC OXGCOCCAC TOOQTOC7TCT ICATTCAflOC OCOOCACCOT OCOCTTACCT 
25921 CCCTTOOCOT OCTTGATTAG CACO0OTO00 TTOCTOAAAC OCACCATXTO TAOCOCCACA 
25981 TCTTCTCTTT CTTCCTCOCT CTCCAOOATC ACCT CTO QGC ATOQOQOQCO tffcOOOCTTC 
26041 OOAOATOMC CCTTCTTTTT CT TT T T CQ AC OCAATOQCCA AATOOOOOGT OOAOOTOOAT 
26101 QQ CO QOQQOC TOOOTOTCJOO OOCXAOOOC OCATC1TOIO AOOAOTCTTC TO OQtOCTD O 
26161 GACTOttAOAC OCCGCCTCAG COOCTTTTTT OQOOQCCC0C 005GAOQCOO CQGOGAOQOC 
26221 OAOQOOGAOO ACAOOTCCTC CATOOTTOOT OCAOOTCCOC OOQCAOOQCO TCCCOGCTOQ 
26281 QQQGTOOrrr COCOCTOCTC CTCTTOCOOA CTOQCCATTT OCT1CICCTA TAOGCAGAAA 
26341 AAOATCATOG AGTCAOTOOA OAAOOMQAC AOCCTAACCO OOOCCTTIOA CTT0C3OCAOC 
26401 ACCOCCTOCA CCOATCCOGC CAAOOOOCCT ACCACCTTOC COQTOOA00C ACCGCGQCTT 
26461 GAQGAOGAGG AACTCATTAT CX2A0CA9GAC CCMOTTTM TAAOCOAAOA OOAOOAOOAT 
26521 OOCTCAGTAC CAACAGAOGA 1AAAAA0CAA GACCAOQACG AOCGAOA00C AAAOGAOOAA 
26581 CAAGTOGOOC QQQQQQAOCA AAOGCATOGC OACtACCTAQ ATOTOGGAGA COAOSTOCTO 
26641 TTOAAGCAIC TOCAOCGCCA GTCCX3CCATT ATCTOOGACO COTTOCAAOA OCOCAOCOAT 
26701 GXtXXXXTCC CCATAOCOOA TOTCA0CCTT OCCIAOGAAC CCCACCTOTT CTCACCO0GC 
26761 OTACCCCCCA AACOCGAAOA AAACOOCACA TOOOAOCCCA A CCCQCQC CT CAACTTCXAC 
26821 OO00TATTTO COOTOOCAGA QQ TO CT TO OC AOCTATCACA TCTTTTTCCA AAACTQCAAG 
26881 ATAOCOCTAT CCTOCCOTOC CAACCQCAOC COAGCOGACA A0CX0CTO9C CWOCOOCAO 
26941 OOOQCTOTCA TACCT3ATAT O30CTCQCT C GAGGAAGTOC CAAAAATCTT TOAOOOICTT 
27<M>1 OGAOGQ2ACO AOAAAOQOQC OOCAAACGCT CIOCAACAM AAAACA0OO* AAASOAAACT 
27061 CACTOTOOAO TQCTOQTOQA ACTTOAOCGT OACAAOOCOC O0CTAQ000* OCTOAAAOGC 
27121 AOCATCGAGG TCACCCACTT TOCCtAOCOO OCACTTAACC TACCCCCCAA OQTTKSOMC 
27181 ACAOTCATOA OCOAOCTOA? COTOOQC O Q T OCACCAOCCC TOQM3AO0QA TGCAAACTTO 
27241 CAAGAACAAA COGAOGAOQO CCTACCCOCA GTTOOCOATO A0CMCTO9C OOOCfOOCTT 
27301 CAOAOQOOOO AOOCTCCCOA CTTOOA0OA0 OQAOQCAAOC TAATCATCJOC COCAOTOCTT 

27361 orxAOooroo McnoHna casocaococ ttdtttocto AOCOQQAQAT OCA0OQCAAC 

27421 CTM3A0GAAA OGTTOCACTA CAOCTTTCOC CAMQC1AOG TQCQCCAOQ C CTOCAXXA'W 
27481 1CCAAOOTOC AOCTCTOCAA CCTOCTCTOC TACCTTOGAA TTTTOCAOGA AAACCOCCTC 
27541 OOOCAAAACO tOCTTCATTC CACOCTCAA3 OOCOAMOOC QCCQCQACTA OOTOCQO QA C 
27601 TOOOTTTACT TATTTCTOTO CTACACCTCW CAAACMCCA TOOOCGT3TG OCAOCAATOC 
27661 CTOGAflQAGC OCAACCTAAA OGMCTOCA0 AAOCTOCTAA AOCAAAACTO OAAOGACCTA 
27721 TOGACOGCCT TCAAOfiMOG CTOOOTQ00C OCOCACCTW OQGACAMAT CTXIXXJOOAA 
27781 OOOCTQprXA AAACCCTOCA ACAOCXHCTO CCAQACTICA (XAGTCAAAO CATSTTOCAA 
27841 AACTTTAOOA ACTTOA3CCT AOACCOTJCA OOAATTCTOC COOOCACCTO CTOTOCOCTT 
27901 CCTAOCOUCT TTOTOCCCAT TAAOTAOOOX OAATOCCCTC C OCC0CTT TO OCX3TCACTOC 
27961 TACCTTCTQQ ACCTAOCCAA CTACCTTOCC TACCACTCOa ACATCATOOA AOACCTCACC 
28021 OCTOAOOOCC TACTOC2MT8 TCACTOTDOC TOCAACCTAT OCAC COCQCA OCOCTCCCTC 
28081 OTCTCCAATT COCAACTOCT TAQOQAAAQT CAAATTATOO OTACCTTTCA OCTOCAOOOT 
28141 CCCTOOCCTQ ACGAAAAOIC COCOOCTCCO OOCTTCAAAC TCACXCOOOG OCTOTQGACC 
28201 1XXGCTTACC TTOCCAAATT TOTACCTOAO OACTACCAOO COCACQA0AT TAOOTTCTAC 
28261 GAACACCAAT CCOCCCOOCC AAATOCOOA0 CTTACOOCCT OOOTCATTAC OCMOQCCAC 
28321 ATCCTTOQCC AATTOCAAOC CATCAACAAA CCCCOCCAAQ ACTTTCTQCT AOCAAAQQGA 
28381 OOOOOOOTTT AOCTOGACOC CCAOTCCOOC GMXZAOCTCA ACCCAATCCC OOCOOQQOOQ 
28441 CMCCCTATC AOCAOCCOCG QGCCCTTOCT TCOCMOATO OCACCCAAAA ACAAOCTGCA 
28501 CCTQCCGCOG CCCTCACCCA COQACOA90A 0GAATACTO0 OACACTCAOG CACAOGAOCT 
28561 T T TOCA 0QM QAQQAOGAQA TOATOGAACA CTOOQACACC CTAOAOCAAG 
28621 OGAAQAGCTG TCAOACGAAA dAOCOTCAtC CTOQOTOOCA TTCCCCTCOC OOOCOCCOCA 
28681 GAAATTOC3CA ACCOTTCCCA OCATOOCTAC AACCTCCGCT CXTICAOCCOC OCCCQGCACT 
28741 OCCraiTCOC COACCCAACC OTACA3XXX2A CACCACTOGA ACCAGC3GCCG CTAAGTCTAA 
28801 GCAOOCOCCC COOTTAOCCC AAOAOCAACA ACAOOOQCAA O0CTAO09CT OOTOOCOCOC 
28861 CCACAAGAAC GCCATAGTTO CTTOCT TO CA ACACTCTOCW OOCAACATCT CCTTOOCCCC 
28921 CCOCTTTCTT CTCTACCATC ACOOCCTCOC CXTCCCCCGT AACATCCTOC ATTACTACCG 
28981 TCATCTCTAC AGCCCCTACT GCACOC5CCO0 CAOCOOCAGC OQCAOCAACA OCAOOGGTCA 
29041 CACAGAAOCA AAOGCGACCG GAXAQCAAOA CTCTSACAAA GCCCAAOAAA TCCACAGCGG. 
29101 COOCAGCAGC AGGAGGAGGA GCOCTCCGTC TSC3CGCCCAA COAACCCOTA TCGACCCQCG 
29161 AGCTTAGAAA TAGOATTTTT CCCACTCTCT ATOCTATATT TCAACAAAOC AGOGOCCAAG 
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29221 AACAAOAOCT GAAAATAAAA AACAOOtCT C 70CGCT0CCT CA CCC CCAftr TOCCTCTATC 
29261 ACAAAAQCOA AQASCAGCTT C0OOQCAO0C TOOAAflACOC OQAOQCTCTC TOCAOCAAAT 
29341 ACTOOQCOCT OACTCTTAM3 OACTACTTIC OCOCCCTTTC TCAAATTTAA OOOOa A AAAC 
29401 TAOOTCATCT CCAOCOOCCA CACCOOOOOC CAOCACCTOT COTCAGOOOC ATCATOAOCA 
29461 AOOAAATICC CACOCCCTAC ATCTOttAOTT ACCAOCCACA AATCOCACTT GOQQCTOOAG 
2952 X CTOCCCAAOA CXACTCAACC COAATAAACT ACAIOAOOOC OOQACCOCAC ATOATATOCC 
29S31 GQ81CAAOQO AATCCOOOCC CACOOAAACC CAATTCTCCT COAACAOOCO OCTATTACCA 
29641 CCACACCTCQ TAATAACOTT AAXCCCCQTA OTTOOCCCOC TOCCCTGWTO TACCAOOAAA 
29701 -BXCCCQCTCC CACCACTOTC OTACTXCCCA OACAOCCCCA aXXOAAOTT CAGATOACTA 
29761 ACTCAOOOOC OCAQCTTOCO OOOOOCTPTC OTCACAOOOT GCOOTCOCCC OOOCAOGOTA 
29821 TAAC7CACCT GAAAATCAflA QQQOQAQQTA TTCAOCTCAA COACQA CTOQ OTCACCTOCT 
29881 CTCTTOOTCT COCTCOOGAC G03ACATTTC ACATOGOCaQ COCTOGCOGC TCTTCATTTA 
29941 CQCCCCOTCA <MOOATCCTA ACTCTOCAGA CCTOOTCCTC GQAGCCOOQC TCCGGAOOCA 
30001 TTOOAACICT ACAATTTATT GAQGAOTTOO WOCTICQCSr TTACTTCAAC CCCTMTCTG 
20061 CACCTOOCOG CCACTACCCO OACCAGTTTA TTCCCAACTT TCACOO QOTO AAAGACTCOG 
20121 ajOAOOOCTA CGACTGAAT8 ACCAflKWAG AOOCAGAGCa AC TOOOCCTO A CACACC TCO 
30181 ACCACTOCCO CCOOCACAAO TGCTTTQCCC 0CO0CTCOC5Q TOAGTTTTGT TAC^TTOAAT 
30241 TOCCCGAAGA GCATATOGAQ ©GCCCGCCOC AOQOCOTOCC OCTCACCACC CAOOTOOAOC 
30301 TTACACCTAO CCTOATTOGO GAOTTTACCA AOOQCC CO C T GCTAOTQCAG CQOOAOCO GG 
30361 GTCCCTCWXST TCTOACOCTO OTTTOCAACT GTCCTAACCC TOQATTACAT CAAOATCTTT 
30421 OTCGTCATCT CTCTCCTOAQ TAXAATAAA9 ACAOAAATTA GAATCTACTO OOOCTCCTOT 
30481 COCGATQCTO TOAACGCCAC OOTTTITACC CAOOCAAAOC AGACCAAAOC AAACCTCACC 
30S41 TO0GCTTTOC ACAAGOQOOC CAAXAAGTAC CTTACCTOQT ACOTJAAOOO CTCTTCATTT 
30601 OtAATTTACA ACAOTTTCCA OOOMOACOAA CTAAGTTTOC CACACAACCT TCTOOGCTTC 
30661 AACTACACCO ICAACAAAAA CACCACCACC ACCACCCTCC TCACCTOCOO OOAACOTAOG 
.20721 AOTOCOTCAC COGTTOCTOC OCCCACACCT ACAOCCTOAQ CCTAACCAOA CATT ACTCOC 
30781 ATTTTTCCAA AACAOOAOOT OAOCTCAACT CCOOQAACTC AGCTCAAAAA AOCAT TT T CC 
30941 CXXX3TOCTOO GATTTTTTAA TTAAOTATAT OAGCAATTCA ACTAACTCTA CAAOCTTOIC 
3O901 TAATTTTXCT OGAATTGOGG TCOGQGTTAT CCTTACTCTT OTAAOTCTOT TTAOTCTTAT 
30961 ACTAGCACT? CTCTOCCTTA OOSWOCCOC CTOCTOCACC CAOOTTTOTA OCTATTO TCA 
31021 QC W CTTA AA COCTGGGOGC AACATCCAAO ATGAOQTACA TOATTTTAOQ CTTO CTCGCC 
31081 CTTOCOGCAO TCTOCAGCGC TOCCAAAAAC GTTOAOTTTA AOGAACCAOC TO3CAATOTT 
31141 ACATTTAAAT CAOAAOCTAA TOAATOCAC? ACTCTTATAA AATOCACCAC AG AACAT8A A 
31201 AACCTTATEA TTCGCCACAA AGACAAAATT OGCAACTATO CTQTATATOC TATTT0OCAO 
31261 CCAGCTGACA CTAAOQACTA TAATCTCACA OICTTCCAAC CXOAAAATCO TAAAACTPPT 
31321 ATOTATAAAT TTCCATTTTA TOAAATCTOC CATATTAOCA TOTACAT CAG CAAACAQTAC 
31381 AAOTICTOOC CCCCACAAAA OTOTTTAOAC AACACTOOCA CCTTTICTTC CACCQCTCTG 
31441 CTTATTACAC COCTTCCTTT QCTATOTACC TTACTTTATC TCAAATACAA AA QCAQA COC 
31501 ACTTTTATTC ATSAAAASAA AATCCCOTCA TTTTCCGCTT (OTTOTA3TC CCCTOOACAA 
31561 TTTACTCTAT CTCSCOATATC CTCCAOOOOC OCAAOATTAT AOCCACAACC TTCAAATCAA 

31621 ACTTTCCIOO accttagccc ctoatttctc ccaococctc cactocaaat ttqatcaaac 

31681 CCAOCrtCAO CTTOCCTCCT CCAOAOATSA ODOOCTCAAC CATOBCOCCC ACAACOOACT 

"J" SSSSS SSSaS ??SoacatcJ SScttttc CATAOCOCTT ATOTTTOTTT 

31861 OCcSaTTAT TATOTOGCTT A1TTOTTCCC TAAAOCOCAC ACOCOCCAOA CCCCCCATCT 
31921 ATAOOCCTAT CATTCTOCTC AACCCACACA ATOAAAAAAT TCATACATTG OACOGTCTGA 
31981 AACCATOTTC TCTTCTTTTA CAOTATCATT AAATCACACA TOATTCCT0C AOTTCTTATA 
32041 TTATTGACCC TTGTTQCOCT TTTCTGTOCG TGCTCTACAT TCX3CCOCOGT CGCTCACATC • 
32101 OAAOTAOATT OCATCCCACC TTTCACACTT TACCTOCOTT ACCGATTTCT CACCCTTATC 
32161 CICATCTOCA OCCTCGTCAC TOTAOTCATC CCCTTCATTC AOTTCACTOA CTOOOTTTOT 
32221 GtoSStW COTACCTCAO OCACCATCCC CA ATACAflAO ACAOOACTAT AOCTGATCTT 
32281 CTCAGAATTC TTTAATTATC AAACGGAGTO TCAITTriCT TTTCCTCATT TTTTCCGCCC 
• 32341 TACCTCTOCT TTGCTCCCAA ACCTCAOCGC CTTCCCAAAAG ACATATTTCC TOCACA1TCA 
32401 CTCAAATATG GAACATTCCC ACCTCCTACA ACAAACAOAO COATTTOTCA OAACCC1X5CT 
32461 TATACGCCAT CATCTCTGTC ATOGTnTTT OCAOTACCAT TTTTOCCCTA GCCATATATC 
32S21 CATACCITCA CATTGGCTGG AAT5CCATAC ATSCCATCAA CCACCCTACT TTCCCACTgC 

lllli SSSSi ScacScaa caogttattc ccccaatcaa tcaocct>coc cccccrrcrc 
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32641 CCXCOCCCAC TOAGATTAOC TACTTTAATT TOACAGCTQG AGATOACTOA ATCTCTAOAT 
32701 CTAGAATTOG ATOGAATTAA CACCGAACAG CGCCTACTAG AAAOGCDGAA OGCQOOGTOC 
32761 GAGOGAGAAC GCCTAAAACA AGAAGTXGAA QACATOOTCA ACCTACACCA CTCTAAAAOA 
12821 GGTATCTTTT OTOTOOTCAA OCMOCCAAA CTTACCTAOO AAAAAACCAC TACOGGCAAC 
32881 COOCTCAGCT ACAAOCTACC CAOCCACOOC CAAAAACtOO TOCTT A TOOT OOOAOAAAAA 
32941 CCTATCACCG TCACCCAGCA CTOGOCAGAA ACAQAOOOCT OCCTOCACTT OCOCTAJCAG 
33001 OCTTCCAOAOO ACCTCTCCAC TCTTATTAAA ACCATOTOTO CTATTAOAOA TCTTATTCCA 
33061 TTCAACTAAC ATAAACACAC AATAAATTAC TTACTTAAAA TCACTCAQCA AATCTTTGTC 
33121 CAGCTTATTC AOCATCACCT OCT CT CCTT C CTCCCAACTC TOOTATCTCA OCCQ CCTTCT 
33181 AQCTGCAAAC TTCKTICCAAA GTTTAAATOO OA3CTCAAAT TCCTCATOTT CTTOTOCCTC 
33241 COCACCGACT ATCTOCATAT TOTTOCA3AT 6AAAC00QCC AGACCOTCTC AAGACACCTT 
33301 CAACCCOGTC TATCCATATO ACACAGAAAC CGQOCCTCCA ACTOTCCCCT TTCTTACOCC 
33361 TCC ATTTOTT TCACCCAATO GTTTOCAAGA AACTCCCCCT CXttGTTCTCT CTCTAOGOGT 
33421 CTCCGAACCT TTOOACACCT CCCAOQQCAT GCTTOCOCTT AAAATOGGCA OCGGTCTTAC 
33481 CCTAOACAA0 GCCOGAAACC TCACCTOCCA AAATOTAACC ACTCTTACTC A0CCACTTAA 
33541 AAAAACAAA0 TCAAACATAA GTTTOGACAC CTCCGCACCA CTTACAATTA CCTCAOOCCC 
33601 CCTAACAGTO OCAACCACOG CTOCTCTOAT AGTTACTAOC OGOQCTCTTA OCGTACAGTC 
33661 ACAAOCCCCA CTOACOOTOC AAGACTOCAA ACIAAQCATT GCTACTAAAG GOCCCATTAC 
33721 AGTOTCAGAT GCAAAGCTAfl CCCTOCAAAC ATCAOCCCCC CTCTCTGGCA GTGAGAGOGA 
J3T81 CACCCTIACT OTAACTCCAT CAOCCCOOCT AACTACTOOC AOOOSTMCT TOQQCATTAA 
33841 CATOGAAOAT CCTMTTATO TAAATAATOO AAAAATAOGA ATTAAAATAA GOOOTCOTPT 
33901 GCAAOTACCA CAAAACTCOO ATACACTAAC ACTAOTTACT OOACCAGOTG TCACOGTTUA 
33961 ACAAAACTOC CTTACAAOCA AAOTTOCAOG AQCTATTOOT TATOATTCAT CAAACAACAT 
34021 GOAAATTAAA A CO OQ CO GTO OCATOOQTCAT AAATAACAAC TTOTTAATTC TAGATCTQCA 
34081 TXACCCATTT GATCOTCAAA CAAAACtAOG TCTTAAACTO OQQCA 0QQAC OCCTGTATAT 
34141 TAATCCATCT CATAACTT30 ACATAAACTA TAACAOAOOC C TATAC CTTT T TAATQ CATC 
34201 AAACAAXACT AAAAAACTGG AAOTtAGCAT AAAAAA ATCC AGTGCACTAA A CTTTO A3CAA 
34261 TACraCCATA OCTATAAATO CAQGAAAOOG TCTOOAOTTT GATACAAACA CATCTOAGTC 
34321 TCCAOATATC AACCCAATAA AAACTAAAAT IGQCfCTGQC ATTCATTACA ATQAAAAOOO 
34381 TOCCATOACT ACTAAACTTO QAOOQQQTTT AA0CTTOCAC AACTCAOQOG CCATTACAAT 
34441 AOOAAACAAA AATOATQACA AACMACCCT C TOGACA ACC CCAGACCCAT CTCCTAACTO 
34501 CAOAATTCAT TCAQATAATO ACTflCAAATT TACTTTOOTT CTTCAC AAAAT CTOOOAGTCA 
34561 AOTACTAGCT ACTOTAGCTG C T T XP fl CTOT ATCTGGAGAT CTTTCATCCA TGACAQGCAC 
34621 COTTOCAAQT GTTAOTATAT TCCWACATT TOACCAAAAC OOTOTTCTAA TOGAOAAC1C 
34681 CTCACTEAAA AAACATTACT OGAACTTWG AAAT99QAAC TCAACTAATQ CAAATOCATA 
34741 CACAAATOCA GTTGGATTTA TOCCTAACCT TCTAQCCTAT CCAAAAACCC AAAGTCAAAC 
34801 TOCTAAAAAT MCATTOTCA GTCAACTTTA CTTOCATOCT CATAAAACTA AACCT ATOAT 
34861 ACTTACCATT ACACTTAATO GCACTACTCA ATCCACASAA ACTAOCQAOO TAAQCAC 1TA 
34921 CTCTATCTCT 0TTACATOC3T CCTO00AAA3 TOOAAAATAC AOCACTQ AAA CTTTTGCTAC 
34981 CAACTCTTAC ACCTTCTCCT ACATTOCCCA 0GA*CAAAOA ATCCTCAACC TOTTOCATCT 
35041 TATOTTTCAA COTOGGATCC TTTATTATAC OGCAAQTCCA COCCTACATS QQQGTXGhG? 
35101 CATAATCCTG CATCAOGATA OOOCOCTTOGT CCTOCAOCAO COCOCOAATA AACTOCTOCC 
35161 GCOGCiOGCTC CCTCCTCCAO OAATACAACA TOOCAOTOCT CTCCTCAOCO ATCATTCGCA 
35221 CCCCCCOCAO CATOAOACCC CTTOTCCTCC GOC5CACASCA OCOCACCCTO ATCTCACTTA 
35281 AATCAGCACA GTAACTQCAC CACAGCACCA CAATATTOTT CAAAATCCCA CAOTGCAACG 
35341 CGCTOTATCC AAAGCTCATC GOOaXttCCA CAGAACCCAC OTOOCCATCA TA OCACAAGC 
35401 GCAOQTAGAT TAAGTOOOGA CCXCTCATAA ACACCCTCCA CATAAACATT ACCTCTTTTC 
35461 QCATQTTO TA ATTCACCACC TCCCOCTAOC ATATAAACCT CTOATTAAAC ATOGCGCCAT 
35521 CCACCACCAT CCTAAACCAG CTQOCCAAAA CCTGCCCGCC GOCTATCCAC TCCAOGGAAC 
35581 COGGACTGGA ACAATCACAG TOGAQAGCCC AOGACTCCTA ACCATOGATC ATCATGCTCO 
35641 TCATGATATC AATCTTGOCA CAACACAOGC ACACOTQCAT ACACTICCTC AOGATTACAA 
35701 GCTCCTCCOG CGTCAGAACC ATATCCCAGG GAACAACCCA TPCCTGAATC AGCGTAAATC 
35761 CCACACTOCA OOQAAGACCT CGCACGTAAC TCACCTTCTG CATTGTCAAA GTOTTACATT 
35821 CGGGCAGCAG CGGATCATCC TCCAGTATGG TAGCOCOGGT CTCTOTCTCA AAAGCAGCTA 
35881 GGCGATCCCT ACTGTACGGA GTGOGCCGAG ACAACCGAGA TCGTCTTGGT CGTAGTCTCA 
35941 TCCCAAATOG AAOGCCGGAG GTAGTCATAT TTCATCCACA COGCACCAGC TCAATCACTC 
36001' ACAGTGTAAA AAGGGCCAAG TACAGAGCGA GTATATATAG GACTAAAAAA TGACGTAACG 
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Nucleotide Sequence Analysis (conL) 

36061 <3TTAAAGTCC ACAAAAAACA CCCAGAAAAC CCCACGOGAA CCTACOCCCA GAAACGAAAG 
36121 CCAAAAAACC CACAACTTCC TCAAA1CTTC ACTTCCOTTT TCCGACGATA COTCACTICC 
36181 CATTTTAAAA AAACTACAAT TCCCAATACA TOCAAGTTAC TCOOCCCTAA AACCTAOOTC 
36241 ACCOOOCCOO TTCCCACOCC CCCOCOCACG TCACAAACTC CACCCCC1CA TTfcTCATATT 
36301 OGCTTCAATC CAAAATAA0C TATATTATCA TCATO 
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(1) GENERAL INFORMATION: 

(i) APPLICANTS: Gregory, R.J., Armentano, D. , Couture, L.A., Smith, 
A.E. 

(ii) TITLE OP INVENTION: GENE THERAPY POR CYSTIC FIBROSIS 



(iii) NUMBER OF SEQUENCES: 9 



(iv) CORRESPONDENCE ADDRESS: 

15 (A) ADDRESS BE : LAHIVE & COCKFIELD 

(B) STREET: 60 STATE* STREET,* SUITE 510 

(C) CITY: BOSTON 

(D) STATE: MASSACHUSETTS 

(E) COUNTRY: USA 
20 (F) ZIP: 02109 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

25 (C) OPERATING SYSTEM: PC-D08/MS -DOS 

(D) SOFTWARE: ASCII 

(vi) CURRENT APPLICATION DATA: 
(A) APPLICATION NUMBER: 
30 (B) FILING DATS: 02 -DEC- 1993 

(C) CLASSIFICATION : 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 07/965,478 
35 (B) FILING DATS: 02-DEC-1992 

(C) CLASSIFICATION: 

(viii) ATTORNEY/AGENT INFORMATION: 
(A) NAME: Hun ley, Elizabeth A. 

40 (B) REGISTRATION NUMBER: 33,505 

(C) REFERENCE /DOCKET NUMBER: NZI-014CP2PC' 

(ix) TELECOMMUNICATION INFORMATION : 
(A) TELEPHONE: (617) 227-7400 
45 (B) TELEFAX : (617) 227-5941 



(2) INFORMATION POR SBQ ID NO:l: 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6129 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNBSS : single 

(D) TOPOLOGY: linear 



55 



(ii) MOLECULE TYPE: cDNA 



-97- 

<ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 133.. 4572 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

AATTGGAAGC AAATGACATC ACAGCAGGTC AGAGAAAAAG GGTTGAGCGG CAGGCACCCA 60 
GAGTAGTAGG TCTTTGGCAT TAGGAGCTTG AGCCCAGACG GCCCTAGCAG GGACCCCAGC 120 



10 



GCCCGAGAGA CC ATG CAG AGG TCG CCT CTG GAA AAG GCC AGC GTT GTC 168 
Met Gin Arg Ser Pro Leu Glu Lys Ala Ser Val Val 
15 1 5 10 



20 



TCC AAA CTT TTT TTC AGC TGG ACC AGA CCA ATT TTG AGG AAA GGA TAC 216 
Ser Lys Leu Phe Phe Ser Tip Thr Arg Pro lie Leu Arg Lys Gly Tyr 
15 20 25 

AGA CAG CGC CTG GAA TTG TCA GAC ATA TAC CAA ATC CCT TCT GTT GAT 264 
Arg Gin Arg Leu Glu Leu Ser Asp lie Tyr Gin lie Pro Ser Val Asp 
30 35 40 

25 TCT OCT GAC AAT CTA TCT GAA AAA TTG GAA AGA GAA TGG GAT AGA GAG 312 
Ser Ala Asp Asn Leu Ser Glu Lys Leu Glu Arg Glu Trp Asp Arg Glu 
45 50 55 60 

CTG OCT TCA AAG AAA AAT CCT AAA CTC ATT AAT GCC CTT COG COA TGT ...... 360 

30 Leu Ala Ser Lys Lys Asn Pro Lys Leu lie Asn Ala Leu Arg Arg Cys 

65 70 75 

TTT TTC TGG AGA TTT ATG TTC TAT GGA ATC TTT TTA TAT TTA GGG GAA 408 
Phe Phe Trp Arg Phe Met Phe Tyr Gly lie Phe Leu Tyr Leu Gly Glu 
35 60 85 90 

GTC ACC AAA GCA GTA CAG CCT CTC TTA CTG GGA AGA ATC ATA OCT TCC 456 
Val Thr Lys Ala Val Gin Pro Leu Leu Leu Gly Arg He lie Ala Ser 
95 100 105 

40 

TAT GAC CCG GAT AAC AAG GAG GAA CGC TCT* ATC GCG ATT TAT CTA GGC 504 
Tyr Asp Pro Asp Asn Lys Glu Glu Arg Ser He Ala He Tyr Leu Gly 
110 115 120 

45 ATA GGC TTA TGC CTT CTC TTT ATT GTG AGG ACA CTG CTC CTA CAC CCA 552 
He Gly Leu Cys Leu Leu Phe He Val Ar^ Thr Leu Leu Leu His Pro 
125 ia0 135 140 # 

GCC ATT TTT GGC CTf*AT CAC ATT GGA ATG CAG ATG AGA ATA GCT ATG 600 
50 Ala lie Phe Gly Leu His His He Gly Met Gin Met Arg He Ala Met 

145 150 155 

TTT AGT TTG ATT TAT AAG AAG ACT TTA AAG CTG TCA AGC CGT GTT CTA 648 
Phe Ser Leu He Tyr Lys Lys Thr Leu Lys Leu Ser Ser Arg Val Leu 
55 160 165 170 * 
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GAT AAA ATA AGT ATT GGA CAA CTT GIT AGT CTC CTT TCC AAC AAC CTG S9S 

Asp Lys lie Ser He Gly Gin Leu Val Ser Leu Leu Ser Asn Asn Leu 
175 180 185 

5 

AAC AAA TTT GAT GAA GGA CTT GCA TTG GCA CAT TTC GTG TGG ATC GCT 744 

Asn Lys Phe Asp Glu Gly Leu Ala Leu Ala His Phe Val Trp He Ala 
190 195 200 

10 CCT TTG CAA GTG GCA CTC CTC ATG GGG CTA ATC TGG GAG TTG TTA CAG 792 
Pro Leu Gin Val Ala Leu Leu Met Gly Leu He Trp Glu Leu Leu Gin 
205 210 215 220 

GCG TCT GCC TTC TGT GGA CTT GGT TTC CTG ATA GTC CTT GCC CTT TTT 840 
15 Ala Ser Ala Phe Cys Gly Leu Gly Phe Leu He Val Leu Ala Leu Phe 

225 230 235 . 

CAG GCT GGG CTA GGG AGA ATG ATG ATG AAG TAC AGA GAT CAG AGA GCT 888 
1 Gin Ala Gly Leu Gly Arg Met Met Met Lys Tyr Arg Asp Gin Arg Ala 
20 240 • ' 245 250 

GGG AAG ATC AGT GAA AGA CTT GTG ATT ACC TCA GAA ATG ATT GAA AAT ' 936 
Gly Lys He Ser Glu Arg Leu Val He Thr Ser Glu Met He Glu Asn 
255 260 265 

25 

ATC CAA TCT GTT AAG GCA TAC TGC TGG GAA GAA GCA ATG GAA AAA ATG 984 
He Gin Ser Val Lys Ala Tyr Cys Trp Glu Glu Ala Met Glu Lys Met 
270 275 280 

30 ATT GAA AAC TTA AGA CAA ACA GAA CTG AAA CTG ACT CGG AAG GCA GCC 1032 
He Glu Asn lieu Arg Gin Thr Glu Leu Lys Leu Thr Arg Lys Ala Ala 
285 290 295 300 

TAT GTG AGA TAC TTC AAT AGC TCA GCC TTC TTC TTC TCA GGG TTC TTT 1080 
35 Tyr Val Arg. Tyr Phe Asn Ser Ser Ala Phe Phe Phe Ser Gly Phe Phe 

305 310 315 

GTG GTG TTT TTA TCT GTG CTT CCC TAT GCA CTA ATC AAA GGA ATC ATC \l2S 
Val Val Phe I-eu Ser Val Leu Pro Tyr Ala Leu He Lys Gly He He 

40 /■ 320 • 325 330 

CTC CGG AAA ATA TTC ACC ACC ATC TCA TTC TGC ATT GTT CTG CGC ATG 1176 

Leu Arg Lys He Phe Thr Thr He Ser Phe Cys He Val Leu Arg Met 
335 340 345 

45 

GCG GTC ACT CGG CAA TTT CCC TGG GCT GTA CAA ACA TGG TAT GAC TCT 1224 

Ala yal Thr Arg Gin' Phe Pro Trp Ala Val Gin Thr Trp Tyr Asp Ser 
.350 355 360 

50 CTT GGA GCA ATA AAC AAA ATA CAG GAT TTC TTA CAA AAG CAA GAA TAT 1272 
Leu Gly Ala He Asn Lys He Gin Asp Phe Leu Gin Lys Gin Glu Tyr 
365 370 375 380 

AAG ACA TTG GAA TAT AAC TTA ACG ACT ACA GAA GTA GTG ATG GAG AAT 1320 
55 Lye Thr Leu Glu Tyr Asn Leu Thr Thr Thr Glu Val Val Met Glu Asn 

385 390 395 
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GTA ACA GCC TTC TGG GAG GAG GGA TTT GGG GAA TTA TTT GAG AAA GCA 1368 
Val Thr Ala Phe Trp Glu Glu Gly Phe Qly Qlu Leu Phc Glu Lya Ala 
400 405 410 

5 

AAA CAA AAC AAT AAC AAT AGA AAA ACT TCT AAT GGT GAT GAC AGC CTC 1416 
Lys Gin Asn Asn Asn Asn Arg Lys Thr Ser Asn Gly Asp Asp Ser Leu 
415 420 425 

10 TTC TTC AGT AAT TTC TCA CTT CTT GGT ACT CCT GTC CTG AAA GAT ATT 1464 
Phe Phe Ser Asn Phe Ser Leu Leu Gly Thr Pro Val Leu Lys Asp lie 
430 435 440 

AAT TTC AAG ATA GAA AGA GGA CAG TTG TTG GCG GTT OCT GGA TCC ACT 1512 
15 Asn Phe Lys He Glu Arg Gly Gin Leu Leu Ala Val Ala Gly Ser Thr 
445 450 * 455 »*fcr 460 

GGA GCA GGC AAG ACT TCA CTT CTA ATG ATG ATT ATG GGA GAA CTG GAG 1560 
Gly Ala Gly Lys Thr Ser Leu Leu Met Met lie Met Gly Glu Leu Glu 
20 465 470 475 

; 

CCT TCA GAG GGT AAA ATT AAG CAC AGT GGA AGA ATT TCA TTC TGT TCT 1608* 
Pro Ser Glu Gly Lys lie Lya His Ser Gly Arg He Ser Phe Cys Ser 
480 485 490 

25 

CAG TTT TCC TGG ATT ATG CCT GGC ACC ATT AAA GAA AAT ATC ATC TTT 1656 
Gin Phe Ser Trp lie Met Pro Gly Thr He Lye Glu Asn He He Phe 
495 500 505 

30 GGT GTT TCC TAT GAT GAA TAT AGA TAG AGA AGC GTC ATC AAA GCA TGC 1704 
Gly Val Ser Tyr Asp Glu Tyr Arg Tyr Arg Ser Val He Lys Ala Cys 
510 515 520 

CAA CTA GAA GAG GAC ATC TCC AAG TTT GCA GAG AAA GAC AAT ATA GTT 1752 
35. Gin Leu Glu Glu Asp He Ser Lys Phe Ala Glu Lys Asp Asn He Val 
525 530 535 540 

i 

CTT GGA GAA GGT GGA ATC ACA CTG AGT GGA GGT CAA CGA GCA AGA ATT 1800 ' 

Leu Gly Glu Gly Gly He Thr Leu Ser Gly Gly Gin Arg Ala Arg He 
40 " 545 550 555 

TCT TTA GCA AGA GCA GTA TAC AAA GAT GCT GAT TTG TAT TTA TTA GAC 1848 
Ser Leu Ala Arg Ala Val Tyr Lys Asp Ala Asp Leu Tyr Leu Leu Asp 
560 565 570 



45 



TCT CCT TTT GGA TAC CTA GAT GTT TTA ACA GAA AAA GAA ATA TTT GAA * 1896 
Ser Pro Phe Gly Tyr Len Asp Val Leu Thr Glu Lys Glu lie Phe Glu 
575 580 / 585 



50 AGC TGT GTC TGT AAA CTG ATG GCT AAC AAA ACT AGG ATT TTG GTC ACT 1944 
Ser Cys Val Cys Lys Leu Met Ala Asn Lys Thr Arg He Leu Val Thr 
590 595 600 

TCT AAA ATG GAA CAT TTA AAG AAA GCT GAC AAA ATA TTA ATT TTG CAT 1992 
55 Ser Lys Met Glu His Leu Lys Lys Ala Asp Lys He Leu He Leu His 
605 610 615 620 



WO 94/12449 




PCT/US93/11667 



GAA GOT AGC AGC TAT TTT TAT GGG ACA TTT TCA GAA CTC CAA AAT CTA 2040 
Glu Gly Ser Ser Tyr Phe Tyr Gly Thr Phe Ser Glu Leu Gin Asn Leu 
625 630 635 

5 

CAG CCA GAC TTT AGC TCA AAA CTC ATG GGA TGT GAT TCT TTC GAC CAA 2088 
Gin Pro Asp Phe Ser Ser Lys Leu Met Gly Cys Asp Ser Phe Asp Gin 
640 645 650 

10 TTT AGT GCA GAA AGA AGA AAT TCA ATC CTA ACT GAG ACC TTA CAC CGT 2136 
Phe Ser Ala Glu Arg Arg Asn Ser lie Leu Thr Glu Thr Leu Hie Arg 
655 660 665 

TTC TCA TTA GAA .GGA GAT GCT CCT GTC TCC TOG ACA GAA ACA AAA AAA 2184 
15 Phe Ser Leu Glu Gly Asp Ala Pro Val Ser Trp Thr Glu Thr Lys Lys. 
670 t# 675 680 

CAA TCT TTT AAA CAG ACT GGA GAG TTT GGG GAA AAA AGG AAG AAT TCT 2232 
Gin Ser Phe Lys Gin Thr Gly Glu Phe Gly Glu Lys Arg Lys Asn Ser 
20 685 690 * 695 700 

ATT CTC AAT CCA ATC AAC TCT ATA CGA AAA TTT TCC ATT GTG CAA AAG 2280 

lie Leu Aan Pro He Asn Ser He Arg Lys Phe Ser He Val Gin Lys 

705 710 715 

25 

ACT CCC TTA CAA ATG AAT GGC ATC GAA GAG GAT TCT GAT GAG CCT TTA 2328 

Thr Pro Leu Gin Met Asn Gly He Glu Glu Asp Ser Asp Glu Pro Leu 
720 725 730 

30 GAG AGA AGG CTG TCC TTA OTA CCA GAT TCT GAG CAG GGA GAG GOG ATA 2376 
Glu Arg Arg Leu Ser Leu Val Pro Asp Ser Glu Gin Gly Glu Ala He 
735 740 745 

CTG CCT CGC ATC AGC GTG ATC AGC ACT GGC CCC ACQ CTT CAG GCA CGA 2424 
35 Leu Pro Arg He Ser Val He Ser Thr Gly Pro Thr Leu Gin Ala Arg 
750 755 760 

AGG AGG CAG TCT GTC CTG AAC CTG ATG ACA CAC TCA GTT AAC CAA GOT 2472 
Arg Arg Gin Ser Val Leu Asn Leu Met Thr His Ser Val Asn Gin Gly 
40 765 770 775 780 

CAG AAC ATT CAC CGA AAG ACA ACA GCA TCC ACA CGA AAA GTG TCA CTG 2520 
Gin Asn He His Arg Lys Thr Thr Ala Ser Thr Arg Lys Val Ser Leu 
785 790 795 

45 

GCC CCT CAG GCA AAC TTG ACT GAA CTG GAT ATA TAT TCA AGA AGG TTA 2568 
Ala Pro Gin Ala Asn Leu Thr Glu Leu Asp He Tyr Ser Arg Arg Leu 
800 805 810 

50 TCT CAA GAA ACT GGC TTG GAA ATA AGT GAA GAA ATT AAC GAA GAA GAC 2616 
Ser Gin Glu Thr Gly Leu Glu He Ser Glu Glu He Asn Glu Glu Asp 
815 820 825 

TTA AAG GAG TOC CTT TTT GAT GAT ATG GAG AGC ATA CCA GCA GTG ACT 2664 
55 Leu Lys Glu Cys Leu Phe Asp Asp Met Glu Ser He Pro Ala Val Thr 
830 835 840 
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ACA TGG AAC ACA TAC CTT CGA TAT ATT ACT GTC CAC AAG AGC TTA ATT 
Thr Trp Asn Thr Tyr Leu Arg Tyr lie Thr Val Hie Lys Ser Leu He 
845 850 855 860 



2712 



TTT GTG CTA ATT TGG TGC TTA GTA ATT TTT CTG GCA GAG GTG GCT GCT 
Phe Val Leu He Trp Cys Leu Val He Phe 1 Leu Ala Glu Val Ala Ala 
865 870 875 

10 TCT TTG GTT GTG CTG TGG CTC CTT GGA AAC ACT CCT CTT CAA GAC AAA 
Ser Leu Val Val Leu Trp Leu Leu Gly Asn Thr Pro Leu Gin Asp Lys 
680 885 890 



2760 



2808 



GGG AAT AGT ACT CAT AGT AGA AAT AAC AGC TAT GCA GTG ATT ATC ACC 
15 * Gly Asn Ser Thr His Ser Arg Asn Asn Ser Tyr Ala Val He He Thr 
895 900 905 



2856 



20 



AGC ACC AGT TCG TAT TAT GTG TTT TAC ATT TAC GTG GGA GTA GCC GAC 
Ser Thr Ser Ser Tyr Tyr Val Phe Tyr He Tyr Val Gly Val Ala Asp 
910 915 920 



2904* 



25 



30 



ACT TTG CTT GCT ATG GGA TTC TTC AGA GGT CTA CCA CTG GTG CAT ACT 2952 
Thr Leu Leu Ala Met Gly Phe Phe Arg Gly Leu Pro Leu Val His Thr 
925 930 935 940 

CTA ATC ACA GTG TCG AAA ATT TTA CAC CAC AAA ATG TTA CAT TCT GTT 3000 
Leu He Thr Val Ser Lys He Leu His His Lys Met Leu His Ser Val 
945 950 955 

CTT CAA GCA CCT ATG TCA ACC CTC AAC ACQ TTG AAA GCA GGT GGG ATT 3048 
Leu Gin Ala Pro Met Ser Thr Leu Asn Thr Leu Lys Ala Gly Gly He 
960 965 970 



CTT AAT AGA TTC TCC AAA GAT ATA GCA ATT TTG GAT GAC CTT CTG CCT 
35 Leu Asn Arg Phe Ser Lys Asp lie Ala He Leu Asp Asp Leu Leu Pro 
975 " 980 985 



3096 



40 



CTT ACC ATA TTT GAC TTC ATC CAQ TTG TTA TTA ATT G^G ATT GGA GCT 
Leu Thr He Phe Asp Phe He Gin Leu Leu Leu He Val He Gly Ala 

990. 995 1000 



3144 



45 



50 



ATA GCA GTT GTC GCA GTT TTA CAA CCC TAC ATC TTT GTT GCA ACA GTG 3192 
He Ala Val Val Ala Val Leu Gin Pro Tyr He Phe Val Ala Thr Val 
1005 1010 1015 1020 

CCA GTG ATA GTG GCT TTT ATT ATG TTG AGA GC* TAT TTC CTC CAA ACC J240 
Pro Val Ue Val Ala Phe Her Met Leu Arg Ala Tyr Phe Leu Gin Thr 
1025 # 1030 * 1035 

TCA CAG CAA CTC AAA CAA CTG GAA TCT GAA GGC AGG AGT CCA ATT TTC 3288 
Ser Gin Gin Leu Lys Gin Leu Glu Ser Glu Gly Arg Ser Pro He Phe 
1040 1045 1050 



55 



ACT CAT CTT GTT ACA AGC TTA AAA GGA CTA TGG ACA CTT CGT GCC TTC 
Thr His Leu Val Thr Ser Leu Lys Gly Leu Trp Thr Leu Arg Ala Phe 
1055 1060 1065 



3336 
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GGA COG CAG CCT TAC TTT GAA ACT CTG TTC CAC AAA OCT CTG AAT TTA 3364 

Gly Arg Gin Pro Tyr Phe Glu Thr Leu Phe His Lys Ala Leu Asn Leu 
1070 1075 1080 

5 

CAT ACT GCC AAC TOG TTC TTG TAC CTG TCA ACA CTG CGC TGG TTC CAA 3432 

His Thr Ala Asn Trp Phe Leu Tyr Leu Ser Thr Leu Arg Trp Phe Gin 
1085 1090 1095 1100 

10 ATG AGA ATA GAA ATG ATT TTT GTC ATC TTC TTC ATT GCT GTT ACC TTC 3480 
Met Arg He Glu Met He Phe Val He Phe Phe He Ala Val Thr Phe 
1105 1110 1115 

ATT TCC ATT TTA ACA ACA GGA GAA GGA GAA GGA AGA GTT GGT ATT ATC 3528 
15 He Ser He Leu Thr Thr Gly Glu Gly Glu Gly Arg Val Gly He He 
1120 # 1125 1130 

CTG ACT TTA GCC ATG AAT ATC ATG AGT ACA TTG CAG TGG GCT GTA AAC 3576 
Leu Thr Leu Ala Met Asn He Met Ser Thr Leu Gin Trp Ala Val Asn 
20 1135 1140 1145 * 

TCC AGC ATA GAT GTG GAT AGC TTG ATG CGA TCT GTG AGC CGA GTC TTT 3624 
Ser Ser He Asp Val Asp Ser Leu Met Arg Ser Val Ser Arg Val Phe 
1150 1155 1160 

25 

AAG TTC ATT GAC ATG CCA ACA GAA GGT AAA CCT ACC AAG TCA ACC AAA 3672 
Lys Phe He Asp Met Pro Thr Giu Gly Lys Pro Thr Lys Ser Thr Lys 
1165 1170 1175 1180 

30 CCA TAC AAG AAT GGC CAA CTC TCG AAA GTT ATG ATT ATT GAG AAT TCA 3720 
Pro Tyr Lys Asn Gly Gin Leu Ser Lys Val Met He He Glu Asn Ser 
1185 1190 1195 

CAC GTG AAG AAA GAT GAC ATC TGG CCC TCA GGG GGC CAA ATG ACT GTC 3768 
35 His Val Lys Lys Asp Asp He Trp Pro Ser Gly Gly Gin Met Thr Val 
1200 1205 1210 

AAA GAT CTC ACA GCA AAA TAC ACA GAA GGT GGA AAT GCC ATA TTA GAG 3816 
f*ys Asp Leu Thr Ala Lys Tyr Thr Glu Gly Gly Asn Ala He Leu Glu 
40 \ 1215 1220 .1225 

AAC ATT TCC TTC TCA ATA AGT CCT GGC CAG AGG GTG GGC CTC TTG GGA 3864 
Asn He Ser Phe Ser He Ser Pro Gly Gin Arg Val Gly Leu Leu Gly 
1230 1235 1240 

45 

AGA ACT GGA TCA GGG* AAG AST ACT TTG TTA TCA, GCT TTT TTG AGA CTA 3912 
Arg Thr Gly Ser Gly Lys Ser Thr Leu Leu Ser Ala Phe Leu Arg Leu 
1245 * 1250 . 1255. 1260 

50 CTG AAC ACT GAA GGA GAA ATC CAG ATC GAT GGT GTG TCT TGG GAT TCA 3960 
Leu Asn Thr Glu Gly Glu He Gin He Asp Gly Val Ser Trp Asp Ser 
1265 1270 1275 

ATA ACT TTG CAA CAG TGG AGG AAA GCC TTT GGA GTG ATA CCA CAG AAA 4008 
55 He Thr Leu Gin Gin *^rp Arg Lys Ala Phe Gly Val He Pro Gin Lys 
1280 1285 1290 
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GTA TTT ATT TTT TCT GGA ACA TTT AGA AAA AAC TTG GAT CCC TAT GAA 4056 
Val Phe lie Phe Ser Gly Thr Phe Arg Lys Asn Leu Asp Pro Tyr Glu 
1295 1300 1305 

CAG TOG AGT GAT CAA GAA ATA TGG AAA GTT GCA GAT GAG GTT GGG CTC 4104 
Gin Trp Ser Asp Gin Glu lie Trp Lye Val Ala Asp Glu Val Gly Leu 
1310 1315 1320 



10 AGA TCT GTG ATA GAA CAG TTT CCT GGG AAG CTT GAC TTT GTC CTT GTG 4152 
Arg Ser Val lie Glu Gin Phe Pro Gly Lys Leu Asp Phe Val Leu Val 
1325 1330 1335 1340 

GAT GGG GGC TGT GTC CTA AGC CAT GGC CAC AAG CAG TTG ATG TGC TTG 4200 
15 Asp Gly Gly Cys Val. Leu Ser His Gly His Lys Gin Leu Met Cys Leu 

1345 1350 1355 

GCT AGA TCT GTT CTC AGT AAG GCG AAG ATC TTG CTG CTT GAT GAA CCC 4248 
Ala Arg Ser Val Leu Ser Lys Ala Lys He Leu Leu Leu Asp Glu Pro 
20 1360 1365 1370 



AGT GCT CAT TTG GAT CCA GTA ACA TAC CAA ATA ATT AGA AGA ACT CTA 4296 
Ser Ala His Leu Asp Pro Val Thr Tyr Gin He He Arg Arg Thr Leu 
1375 1380 1385 

25 

AAA CAA GCA TTT GCT GAT TGC ACA GTA ATT CTC TGT GAA CAC AGG ATA 4344 
Lys Gin Ala Phe Ala Asp Cys Thr Val He Leu Cys Glu His Arg He 
1390 1395 1400 

30 GAA GCA ATG CTG GAA TGC CAA CAA TTT TTG GTC ATA GAA GAG AAC AAA 4392 
Glu Ala Met Leu Glu Cys Gin Gin Phe Leu Val He Glu Glu Asn Lys 
1405 4 1410 1415 1420 

GTG COG CAG TAC GAT TCC ATC CAG AAA CTG CTG AAC GAG AGG AGC CTC 4440 
35 Val Arg Gin Tyr Asp Ser He Gin Lys Leu Leu Asn Glu Arg Ser Leu 

' 1425 1430 1435 

TTC CGG CAA GCC ATC AGC CCC TCC GAC AGG GTG AAG CTC TTT CCC CAC 4488 
Phe Arg Gin Ala He Ser Pro Ser Asp Arg val Lys Leu Phe Pro His 
40 1440 1445 1450 

CGG AAC TCA AGC AAG TGC AAG TCT AAG CCC CAG ATT GCT GCT CTG AAA 4536 
Arg Asn Ser Ser Lys Cys Ly« Ser Lys Pro Gin He Ala Ala Leu Lys 
1455 1460 1465 

45 

GAG GAG ACA GAA GAA GAG GTG CAA GAT ACA AGG CTT TAGAGAGCAG 4582 
Glu. Glu Thr Qlu Glu Glu Val Gin Asp Thr Arg Leu 
1470 ; * 1475 1480 

50 CATAAATGTT GACATGGGAC ATTTGCTCAT GGAATTGGAG CTCGTGGGAC AGTCACCTCA 4642 

TGGAATTGGA GCTCGTGGAA CAGTTACCTC TGCCTCAGAA AACAAGGATG AATTAAGTTT 47 02 

TTTTTTAAAA AAGAAACATT TGGTAAGGGG AATTGAGGAC ACTGATATGG. GTCTTGATAA 4762 

55 

ATGGCTTCCt GGCAATAGTC AAATTGTGTG AAAGGTACTT CAAATCCTTG AAGATTTACC 4822 



ACTTGTGTTT TGCAAGCCAG ATTTTCCTGA AAACCCTTGC CATGTGCTAG TAATTGdftl 4882 
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GGCAGCTCTA AATGTCAATC AGCCTAGTTG ATCAGCTTAT TGTCTAGTGA AACTCGTTAA 494 2 

TTTGTAGTGT TGGAGAAGAA CTGAAATCAT ACTTCTTAGG GTTATGATTA AGTAATGATA 5002 

5 ACTGGAAACT TCAGCGGTTT ATATAAGCTT GTATTCCTTT TTCTCTCCTC TCCCCATGAT 5062 

GTTTAGAAAC ACAACTATAT TGTTTGCTAA GCATTCCAAC TATCTCATTT CCAAGCAAGT 5122 

10 ATTAGAATAC CACAGGAACC ACAAGACTGC ACATCAAAAT ATGCCCCATT CAACATCTAG 51B2 

TGAGCAGTCA GGAAAGAGAA CTTCCAGATC CTGGAAATCA GGGTTAGTAT TGTCCAGGTC 5242 

TACCAAAAAT CTCAATATTT CAGATAATCA CAATACATCC CTTACCTGGG AAAGGGCTGT 53 02 

15 TATAATCTTT CACAOGGGAC AGGATGGTTC CCTTGATGAA OAAGTTGATA TGCCTTTTCC 53 62 

CAACTCCAGA AAGTGACAAG CTCACAGACC TTTGAACTAG AGTTTAOCTG GAAAAGTATG " 5422 

20 TTAGTGCAAA TTGTCACAGG ACAGCCCTTC TTTCCACAOA AGCTCCAGGT AGAGGGTGTG 5482 

TAAGTAGATA GGCCATGGGC ACTGTGGGTA GACACACATG AAGTCCAAGC ATTTAGATGT 5542 

ATAGGTTGAT GGTGGTATGT TTTCAGOCTA GATGTATGTA CTTCATGCTG TCTACACTAA 5602 

25 GAGAGAATGA GAGACACACT OAAGAAGCAC CAATCATGAA TTAGTTTTAT ATGCTTCTGT 5662 

TTTATAATTT TGTGAAGCAA AATTTTTTCT CTAQOAAATA TTTATTTTAA TAATGTTTCA 5722 

30 AACATATATT ACAATOCTOT ATTTTAAAAG AATGATTATG AATTACATTT GTATAAAATA 5782 

ATTTTTATAT TTGAAATATT OACTTTTTAT GGCACTAOTA TTTTTATOAA ATATTATGTT 5842 

AAAACTGGGA CAGGGGAGAA CCTAQGGTOA TATTAACCAG GGGCCATGAA TCACCTTTTO 5902 

35 GTCTGGAGGG AAGCCTTOOG GCTGATCGAG TTQTTGCCCA CAGCTGTATG ATTCCCAGCC 5962 

AGACACAGCC TCTTAGATGC AGTTCTOAAG AAOATGGTAC CACCAGTCTG ACTGTTTCCA 6022 

TCAAGQGTAC ACTGCCTTCT CAACTCCAAA CTGACTCTTA AOAAGACTGC ATTATAtTTA 6082 

TTACTGTAAG AAAATATCAC TTGTCAATAA AATCCATACA TTTUTOT " S129 



40 



(2) INFORMATION FOR SEQ ID NO:2: 

(i) SEQUENCE CHARACTERISTICS: :; 

(A) LENGTH: 1480 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 

55 Met Gin Arg Ser Pro Leu Glu Lya Ala Ser Val yal Ser Lys Leu Phe 

1 5 10 * is • 



45 



50 
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10 



Phe Ser Tip Thr Arg Pro lie Leu Arg Lys Gly Tyr Arg Gin Arg Leu 

20 25 v 30 

Glu Leu Ser Asp lie Tyr Gin lie Pro Ser Val Asp Ser Ala Asp Asn 

35 40 45 

Leu Ser Glu Lys Leu Glu Arg Glu Trp Asp Arg Glu Leu Ala Ser Lys 

50 55 60 

Lys Asn Pro Lys Leu lie Asn Ala Leu Arg Arg Cys Phe Phe Trp Arg 

65 70 75 80 



Phe Met Phe Tyr Gly lie Phe Leu Tyr Lou Gly Glu Val Thr Lys Ala 
15 85 90 95 

Val Gin Pro Leu Leu Leu Gly Arg lie lie Ala Ser Tyr Asp Pro Asp 
100 105 110 

20 Asn Lys Glu Glu Arg Ser He Ala He Tyr Leu Gly lie Gly Leu Cys 
115 120 125 



25 



Leu Leu Phe He Val Arg Thr Leu Leu Leu His Pro Ala He Phe Gly 

130 135 140 

Leu His His He Gly Met Gin Net Arg He Ala Met Phe Ser Leu He 

145 150 155 160 



Tyr Lys Lys Thr Leu Lys Leu Ser Ser Arg Val Leu Asp Lys He Ser 

30 165 170 175 

He Gly Gin Leu Val Ser Leu Leu Ser Asn Asn Leu Asn Lys Phe Asp 
180 185 190 

35 Glu Gly Leu Ala Leu Ala His Phe Val Trp He Ala Pro Leu Gin Val 
195 200 205 



40 



Ala Leu Leu Met Gly Leu He Trp Glu Leu Leu Gin Ala Ser Ala Phe 
210 21S 220 

Cys Gly Leu Gly Phe Leu He Val Leu Ala Leu Phe Gin Ala Gly Leu 

225 230 235 240 



Gly Arg Met Met Met Lys Tyr Arg Asp Gin Arg Ala Gly Lys He Ser 

45 245 250 255 

Glu Arg Leu Val He Thr Ser Glu Met He <^.u Asn He Gin Ser Val 
260 '* 265 270 

50 Lys Ala Tyr Cys Trp Glu Glu Ala Met Glu Lys Met He Glu Asn Leu 

275 280 285 



55 



Arg Gin Thr Glu Leu Lys Leu Thr Arg Lys Ala Ala Tyr Val Arg Tyr 
290 295 300 

Phe Asn Ser Ser Ala Phe Phe Phe Ser Gly Phe Phe Val Val Phe Leu 

305 310 315 320 
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Ser Val Leu Pro Tyr Ala Leu He Lys Gly lie He Leu Arg Lye He 
325 330 335 

Phe Thr Thr He Ser Phe Cys He Val Leu Arg Met Ala Val Thr Arg 
340 345 350 

Gin Phe Pro Trp Ala Val Gin Thr Trp Tyr Asp Ser Leu Gly Ala He 
355 360 365 

Asn Lys He Gin Asp Phe Leu Gin Lys Gin Glu Tyr Lys Thr Leu Glu 
370 375 380 



Tyr Asn Leu Thr Thr Thr Glu Val Val Met Glu Asn Val Thr Ala Phe 
15 385 390 395 40D 



20 



Trp Glu Glu Gly Phe Gly Glu Leu Phe Glu Lys Ala Lys Gin Asn Asn 
405 410 415 

* 

Asn Asn Arg Lys Thr Ser Asn Gly Asp Asp Ser Leu Phe Phe Ser Asn 
420 425 430 



25 



Phe Ser Leu Leu Gly Thr Pro Val Leu Lys Asp He Asn Phe Lys He 
435 440 445 

Glu Arg Gly Gin Leu Leu Ala Val Ala Gly Ser Thr Gly Ala Gly Lys 
450 455 460 



Thr Ser Leu Leu Met Met He Met Gly Glu Leu Glu Pro Ser Glu Gly 
30 465 470 475 480 



35 



Lys He Lys His Ser Gly Arg He Ser Phe Cys Ser Gin Phe Ser Trp 
485 490 495 

He Met Pro Gly Thr He Lys Glu Asn He He Phe Gly Val Ser Tyr 
500 505 510 



40 



Asp Glu Tyr Arg Tyr Arg Ser Val He Lys Ala Cys Gin Leu Glu Glu 
515 520 525 

Asp He Ser Lys Phe Ala Glu Lys Asp Asn He Val Leu Gly Glu Gly 
530 535 540 



Gly He Thr Leu Ser Gly Gly Gin Arg Ala Arg He Ser Leu Ala Arg 
45 545 550 555 560 

Ala Val Tyr Lys ^sp Ala Asp Leu Tyr Leu Leu Asp Ser Pro Phe Gly * 
* • 565* " 570 575 * 

50 Tyr Leu Asp Val Leu Thr Glu Lys Glu He Phe Glu Ser Cys Val Cys 
580 585 590 



55 



Lys Leu Met Ala Asn Lys Thr Arg He Leu Val Thr Ser Lys Met Glu 
595 600 605 

His Leu Lys Lys Ala Asp Lys He Leu He Leu His Glu Gly Ser Ser 

610 615 620 
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Tyr Phe Tyr Gly Thr Phe Ser Glu Leu Gin Asn Leu Gin Pro Asp Phe 

625 630 635 640 

5 Ser Ser Lys Leu Met Gly Cys Asp Ser Phe Asp Gin Phe Ser Ala Glu 

645 650 655 



Arg Arg Asn Ser He Leu Thr Glu Thr Leu His Arg Phe Ser Leu Glu 
660 665 670 

10 

Gly Asp Ala Pro Val Ser Trp Thr Glu Thr Lys Lys Gin Ser Phe Lys 
675 680 685 

Gin Thr Gly Glu Phe Gly Glu Lys Arg Lys Asn Ser lie Leu Asn Pro 
15 690 695 « 700 

lie Asn Ser He Arg Lys Phe Ser He Val Gin Lys Thr Pro Leu Gin 
705 710 715 720 

20 Met Asn Gly lie Glu Glu Asp Ser Asp Glu Pro Leu Glu Arg Arg Leu 

725 730 735 

Ser Leu Val Pro Asp Ser Glu Gin Gly Glu Ala lie Leu Pro Arg He 
740 745 750 

25 

Ser Val He Ser Thr Gly Pro Thr Leu Gin Ala Arg Arg Arg Gin Ser 
755 760 765 



Val Leu Asn Leu Met Thr His Ser Val Asn Gin Gly Gin Asn He His 

30 770 775 780 

Arg Lys Thr Thr Ala Ser Thr Arg Lys Val Ser Leu Ala Pro Gin Ala 

785 790 795 800 

35 Asn Leu Thr Glu Leu Asp He Tyr Ser Arg Arg Leu Ser Gin Glu Thr 

805 810 815 



40 



Gly Leu Glu He Ser Glu Glu He Asn Glu Glu Asp Leu Lys Glu Cys 
820 825 830 

Leu Phe Asp Asp Met Glu Ser He Pro Ala Val Thr Thr Trp Asn Thr 
835 840 845 



Tyr Leu Arg Tyr He Thr Val His Lys Ser Leu He Phe Val Leu He 
45 850 855 860 

Trp Cys Leu Val He Phe Leu Ala Glu Val Ala Ala Ser Leu Val Val 
865 870 875 880 

50 Leu Trp Leu Leu Gly Asn Thr Pro Leu Gin Asp Lys Gly Asn Ser Thr 

885 890 895 



55 



His Ser Arg Asn Asn Ser Tyr Ala Val He He Thr Ser Thr Ser Ser 
900 905 910 

Tyr Tyr Val Phe Tyr He Tyr val Gly Val Ala Asp Thr Leu Leu Ala 
915 920 925 
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Met Gly Phe Phe Arg Gly Leu Pro Leu Val His Thr Leu He Thr Val 
930 935 940 

5 Ser Lys He Leu His His Lys Met Leu His Ser Val Leu Gin Ala Pro 
945 950 955 960 

Met Ser Thr Leu Asn Thr Leu Lys Ala Gly Gly He Leu Asn Arg Phe 

965 970 975 

10 

Ser Lys Asp He Ala He Leu Asp Asp Leu Leu Pro Leu Thr He Phe 
980 985 990 

Asp Phe He Gin Leu Leu Leu He Val He Gly Ala He Ala Val Val 
15 995 1000 1005 

Ala Val Leu Gin Pro Tyr He Phe Val Ala Thr Val Pro Val He Val 
1010 1015 1020 

20 Ala Phe He Met Leu Arg Ala Tyr Phe Leu Gin Thr Ser Gin Gin Leu 
1025 1030 1035 1040 

Lys Gin Leu Glu Ser Glu Gly Arg Ser Pro He Phe Thr His Leu Val 
1045 1050 1055 

25 

Thr Ser Leu Lys Gly Leu Trp Thr Leu Arg Ala Phe Gly Arg Gin Pro 
1060 1065 1070 

Tyr Phe Glu Thr Leu Phe His Lys Ala Leu Asn Leu His thr Ala Asn 

30 1075 1080 1085 

Trp Phe Leu Tyr Leu Ser Thr Leu Arg Trp Phe Gin Met Arg He Glu 
1090 1095 HOO 

35 Met He Phe Val He Phe Phe He Ala Val Thr Phe He Ser He Leu 
1105 1110 H15 H20 

Thr Thr Gly Glu Gly Glu Gly Arg Val Gly He He Leu Thr Leu Ala 
1125 H30 H35 

40 

Met Asn He Met Ser Thr Leu Gin Trp Ala Val Asn Ser Ser He Asp 
1140 H45 H50 

Val Asp Ser Leu Met Arg Ser Val Ser Arg Val Phe Lys Phe He Asp 
45 1155 1160 H65 

Met Pro Thr Glu Gly Lys Pro Thr Lys Ser Thr Lys Pro Tyr Lys Asn 
1170 1175 H80 

50 Gly Gin Leu Ser Lys Val Met He He Glu Asn Ser His Val Lys Lys 
1185 1190 H95 1200 

Asp Asp He Trp Pro Ser Gly Gly Gin Met Thr Val Lys Asp Leu Thr 
1205 1210 1215 

Ala Lys Tyr Thr Glu Gly Gly Asn Ala He Leu Glu Asn He Ser Phe 
1220 1225 1230 
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Ser lie Ser Pro Gly Gin Arg Val Gly Leu Leu Gly Arg Thr Gly Ser 
1235 1240 1245 

5 Gly Lys Ser Thr Leu Leu Ser Ala Phe Leu Arg Leu Leu Asn Thr Glu 
1250 1255 1260 

Gly Glu lie Gin lie Asp Gly Val Ser Trp Asp Ser lie Thr Leu Gin 
1265 1270 1275 1280 

10 

Gin Trp Arg LyB Ala Phe Gly Val lie Pro Gin Lys Val Phe He Phe 
1285 1290 1295 

Ser Gly Thr Phe Arg Lys Asn Leu Asp Pro Tyr Glu Gin Trp Ser Asp 
15 1300 1305 1310 

Gin Glu He Trp Lys Val Ala Asp Glu Val Gly Leu Arg Ser Val He 
1315 1320 1325 

20 Glu Gin Phe Pro Gly Lys Leu Asp Phe Val Leu Val Asp Gly Gly Cys 
1330 1335 1340 

Val Leu Ser His Gly His Lys Gin Leu Met Cys Leu Ala Arg Ser Val 
1345 1350 1355 1360 

25 

Leu Ser Lys Ala Lys He Leu Leu Leu Asp Glu Pro Ser Ala His Leu 
1365 1370 1375 

Asp Pro Val Thr Tyr Gin He He Arg Arg Thr Leu Lys Gin Ala Phe . 
30 1380 1385 1390 

Ala Asp Cys Thr Val He Leu Cys Glu His Arg He Glu Ala Met Leu 
1395 1400 1405 

35 Glu Cys Gin Gin Phe Leu Val He Glu Glu Asn Lys Val Arg Gin Tyr 
1410 1415 1420 

Asp Ser He Gin Lys Leu Leu Asn Glu Arg Ser Leu Phe Arg Gin Ala 
1425 1430 1435 1440 

40 

He Ser Pro Ser Asp Arg Val Lys Leu Phe Pro His Arg Asn Ser Ser 
1445 1450 1455 

Lys Cys Lys Ser Lys Pro Gin He Ala Ala Leu Lys Glu Glu Thr Glu 
45 1460 1465 1470 

Glu Glu Val Gin Asp Thr Arg Leu 
1475 1480 

50 (2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 563 5 base pairs 

(B) TYPE : nucleic acid 
55 (C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 



i (ii) MOLECULE TYPE: cDNA 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 

CATCATCAAT AATATACCTT ATTTTGGATT GAAGCCAATA TGATAATGAG GGGGTGGAGT 6 0 

TTGTGACGTG GCGCGGGGCG TGGGAACGGG GCGGGTGACG TAGTAGTGTG GCGGAAGTGT 120 

GATGTTGCAA GTGTGGCGGA ACACATGTAA GCGCCGGATG TGGTAAAAGT GACGTTTTTG 180 

GTGTGCGCCG GTGTATACGG GAAGTGACAA TTTTCGCGCG GTTTT AGGCG GATGTTGTAG 240 

TAAATTTGGG CGTAACCAAG TAATGTTTGG CCATTTTCGC GGGAAAACTG AATAAGAGGA 300 

AGTGAAATCT GAATAATTCT GTGTTACTCA TAGCGCGTAA TATTTGTCTA GGGCCGCGGG 360 

GACTTTGACC GTTTACGTGG AGACTCGCCC AGGTGTTTTT CTCAGGTGTT TTCCGCCTTC 4^p 

* 

CGGGTCAAAG TTGGCGTTTT ATTATTATAG TCAGCTGACG CGCAGTGTAT TTATACCCGG 480 

TGAGTTCCTC AAGAGGCCAC TCTTGAGTGC CAGCGAGTAG AGTTTTCTCC TCCGAGCCGC 540 

TCCGAGCTAG TAACGGCCGC CAGTGTGCTG CAGATATCAA AGTCGACGGT ACCCGAGAGA 600 

CCATGCAGAG GTCGCCTCTG GAAAAGGCCA GCGTTQTCTC CAAACTTTTT TTCAGCTGGA 660 

CCAGACCAAT TTTGAGGAAA GGATACAGAC AQCGCCTGGA ATTGTCAGAC ATATACCAAA 720 

TCCCTTCTST TGATTCTGCT GACAATCTAT CTGAAAAATT GGAAAOAGAA TGGGATAGAG 780 

AGCTGGCTTC AAAOAAAAAT CCTAAACTCA TTAATGCCCT TCGGCGATGT TTTTTCTOGA 840 

GATTTATGTT CTATGGAATC TTTTTATATT TAGGOOAAGT CACCAAAGCA GTACAOCCTC 900 

TCTTACTQQQ AAOAATCATA GCTTCCTATG ACCCOGATAA CAAGQAOGAA CGCTCTATCG 960 

CGATTTATCT AGGCATAGGC TTATGCCTTC TCTTTATTGT GAGGACACTG CTCCTACACC 102 0 

CAGCCATTTT TGGCCTTCAT CACATTGGAA TGCAGATGAG AATAGCTATG TTTAGTTTGA 1080 

TTTATAAGAA GACTTTAAAG CTGTCAAOCC GTGTTCTAGA TAAAATAAGT ATTGGACAAC 1140 

TTGTTAGTCT CCTTTCCAAC AACCTGAACA AATTTGATGA AGGACTTGCA TTGGCACATT 1200 

TCGTGTGGAT CGCTCCTTTG CAAGTGGCAC TCCTCATGOG GCTAATCTGG GAGTTGTTAC 1260 

AGGCGTCTGC CTTCTGTGGA CTTGGTTTCC TGATAGTCCT TGCC C T TTTT CAGGCTGGGC 1320 

TAGGGAGAAT GfTGATGAAG TACAGAGATC AGAGAGCTGG GAAGATCAGT GAAAOACTTG ' 1380 

TGATTACCTC AGAAATGATT GAAAACATCC AATCTGTTAA GGCATACTGC TGGGAAGAAG 1440 

CAATGGAAAA AATGATTGAA AACTTAAGAC AAACAOAACT GAAACTGACT CGGAAGGCAG 1500 

CCTATGTGAG ATACTTCAAT AGCTCAQCCT TCTTCTTCTC AGGGTTCTTT GTGGTGTTTT 1560 
TATCTGTGCT TCCCTATQCA CTAATCAAAG GAATCATCCT CCGGAAAATA TTCACCACCA *^?L6 # 20 

TCTCATTCTG CATTGTTCTG CGCATQGCGG TCACTCGGCA ATTTCCCTGG GCTGTACAAA 1680 
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CATGGTATGA CTCTCTTGGA GCAATAAACA AAATACAGGA TTTCTTACAA AAGCAAGAAT 1740 

ATAAGACATT GGAATATAAC TTAACGACTA CAGAAGTAGT GATGGAGAAT GTAACAGCCT 1800 

TCTGGGAGGA GGGATTTGGG GAATTATTTG AGAAAGCAAA ACAAAACAAT AACAATAGAA 1860 

AAACTTCTAA TGGTGATGAC AGCCTCTTCT TCAGTAATTT CTCACTTCTT GGTACTCCTG 1920 

TCCTGAAAGA TATTAATTTC AAGATAGAAA GAGGACAGTT GTTGGCGGTT GCTGGATCCA 1980 

CTGGAGCAGG CAAGACTTCA CTTCTAATGA TGATTATGGG AGAACTGGAG CCTTCAGAGG 2 040 

GTAAAATTAA GCACAGTGGA AGAATTTCAT TCTGTTCTCA GTTTTCCTGG ATTATGCCTG 2100 

GCACCATTAA AGAAAATATC ATCTTTGGTG TTTCCTATGA TGAATATAGA TACAGAAGCG 216 0 

TCATCAAAGC ATGCCAACTA GAAGAGGACA TCTCCAAGTT TGCAGAGAAA GACAATATAG 2220 

TTCTTGGAGA AGGTGGAATC ACACTGAGTG GAGGTCAACG AGCAAGAATT TCTTTAGCAA 228 0 

GAOCAGTATA CAAAGATGCT GATTTGTATT TATTAGACTC TCCTTTTGGA TACCTAGATG 2340 

TTTTAACAGA AAAAGAAATA TTTGAAAGCT GTGTCTOTAA ACTGATGOCT AACAAAACTA 2400 

25 GGATTTTGGT CACTTCTAAA ATGGAACATT TAAAOAAAGC TGACAAAATA TTAATTTTGC 2460 

ATGAAGGTAG CAGCTATTTT TATGGGACAT TTTCAOAACT CCAAAATCTA CAGCCAGACT 2520 

TTAGCTCAAA ACTCATGGOA TGTGATTCTT TCGACCAATT TAGTGCAGAA AGAAQAAATT 2580 

CAATCCTAAC TGAGACCTTA CACCGTTTCT CATTAGAAGG AGATGCTCCT GTCTCCTGGA 2640 

CAGAAACAAA AAAACAATCT TTTAAACAGA CTGGAGAGTT TGGGGAAAAA AGGAAGAATT 2700 

CTATTCTCAA TCCAATCAAC TCTATACGAA AATTTTCCAT TGTGCAAAAG ACTCCCTTAC 2760 

AAATGAATGG CATCGAAGAG GATTCTGATG AGCCTTTAGA GAGAAGGCTG TCCTTAOTAC 2820 

CAOATTCTGA GCAGOGAGAG GCGATACTGC CTCGCATCAG CGTGATCAGC ACTGGCCCCA 2880 

CGCTTCAGGC ACGAAGOAGG CAGTCTGTCC TGAACCTGAT GACACACTCA GTTAACCAAG 2940 

GTCAGAACAT TCACCGAAAG ACAACAGCAT CCACACGAAA AGTGTCACTG GCCCCTCAGG 3000 

CAAACTTGAC TGAACTGGAT ATATATTCAA GAAGGTTATC TCAAGAAACT GGCTTOGAAA 3060 

TAAGTGAAGA " AATTAACGAA GAAGACTTAA AGGAGTGCCT TTTTGATGAT ATGGAGAGCA 3120 
.V. . 

TACCAGCAGT GACTACATGG AACACATACC TTCGATATAT TACTGTCCAC AAGAGCTTAA 3180 

50 

TTTTTGTGCT AATTTGGTGC TTAGTAATTT TTCTGGCAGA GGTGGCTGCT TCTTTGGTTG 3240 

TCCTGTGGCT CCTTGOAAAC ACTCCTCTTC AAGACAAAGG GAATAGTACT CATAGTAGAA 3300 

55 ATAACAGCTA TGCAGTOATT ATCACCAGCA CCAGTTCGTA TTATGTGTTT TACATTTACG 3360 

TGGGAGTAGC CGACACTTTG CTTGCTATGG GATTCTTCAG AGGTCTACCA CTGGTGCATA 3420 

CTCTAATCAC AGTGTCGAAA ATTTTACACC ACAAAATGTT ACATTCTGTT CTTCAAGCAC 34 80 
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CTATGTCAAC CCTCAACACG TTGAAAGCAG GTGGGATTCT TAATAGATTC TCCAAAGATA 3540 
TAGCAATTTT GGATGACCTT CTGCCTCTTA CCATATTTGA CTTCATCCAG TTGTTATTAA 3600 
TTGTGATTGG AGCTATAGCA GTTGTCGCAG TTTTACAACC CTACATCTTT GTTGCAACAa 3660 
TGCCAGTGAT AGTGGCTTTT ATTATGTTGA GAGCATATTT CCTCCAAACC TCACAGCAAC 3720 

TCAAACAACT GGAATCTGAA GGCAGGAGTC CAATTTTCAC TCATCTTGTT ACAAGCTTAA 3780 

AAGGACTATG GACACTTCGT GCCTTCGGAC GGCAGCCTTA CTTTGAAACT CTGTTCCACA 384 0 

AAGCTCTGAA TTTACATACT GCCAACTGGT TCTTGTACCT GTCAACACTG CGCTGGTTCC 3 900 

AAATGAGAAT AGAAATGATT TTTGTCATCT TCTTCATTGC TGTTACCTTC ATTTCCATTT 3 960 

TAACAACAGG AGAAGGAGAA GGAAOAGTTG GTATTATCCT GACTTTAGCC ATOAATATCA 4020 

TOAGTACATT GCAGTGGGCT GTAAACTCCA GCATAGATGT GGATAGCTTG ATGCGATCTG 4080 

TGAGCCGAGT CTTTAAOTTC ATTGACATGC CAACAGAAGG TAAACCTACC AAOTCAACCA 4140 

AACCATACAA GAATGGCCAA CTCTCGAAAG TTATGATTAT TGAGAATTCA CACGTCAAGA 4200 

AAGATGACAT CTOGCCCTCA GGGGGCCAAA TGACTGTCAA AGATCTCACA GCAAAATACA 4260 

CAGAAGGTGG AAATGCCATA TTAGAGAACA TTTCCTTCTC AATAAGTCCT GGCCAGAGGG 4320 

TOGGCCTCTT GGGAAOAACT GGATCAGGGA AOAOTACTTT GTTATCAGCT TTTTTOAOAC 4380 

TACTGAACAC TOAAOOAOAA ATCCAGATCG ATOGTOTOTC TTGGOATTCA ATAACTTTGC 4440 

AACAGTQGAG GAAAflCCTTT GOAGTGATAC CACAOAAAOT ATTTATTTTT TCTGGAACAT 4500 

TTAGAAAAAA CTTGGATCCC TATGAACAGT GGAGTGATCA AGAAATATOG AAAGTTGCAG 4560 

ATGAGGTTOG GCTCAOATCT GTGATAGAAC AGTTTCCTGG GAAGCTTGAC TTTGTCCTTG 4620 

TGOATGGGGG CTGTGTCCTA AGCCATGGCC ACAAGCAflTT OATGTGCTTG GCTAGATCTG 4680 

TTCTCAGTAA GGCOAAGATC TTOCTGCTTG ATGAACCCAG TGCTCATTTG GATCCAGTAA 4740 

CATACCAAAT AATTAGAAGA ACTCTAAAAC AAGCATTTGC TGATTGCACA GTAATTCTCT 4800 

GTGAACACAG GATAGAAGCA ATGCTGGAAT GCCAACAATT TTTGGTCATA GAAGAGAACA 4860 

AAGTGCQGCA GTACGATTCC ATCCAOAAAC TGCTGAACGA GAGGAGCCTC TTCCGGCAAG *920 

CCATCAGCCC CTCCGACAGG GTGAAGCTCT TTCCCCACCG GAACTCAAGC AAGTGCAAGT 4980 

CTAAGCCCCA GATTGCTGCT CTGAAAGAGG AGACAGAAGA AGAGGTGCAA GATACAAGGC 5040 

TTTAOAGAGC AGCATAAATG TTGACATOGG ACATTTQCTC ATOGAATTGG AGGTAGCGGA 5100 

TTGAGGTACT GAAATGTGTG GGCGTGGCTT AAQGGTGGGA AAGAATATAT AAGGTGGGGG 5160 

TCTCATGTAG TTTTGTATCT GTTTTGCAGC AGCCGCCGCC ATGAGCGCCA ACTCGTTTGA 522 0 
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TGGAAGCATT GTGAGCTCAT ATTTGACAAC GCGCATGCCC CCATGGGCCG GGGTCCGTCA " 52 8 0 

GAATQTGATG GGCTCCAGCA TTGATGGTCG CCCCGTCCTG CCCGCAAACT CTACTACCTT 534 0 

) GACCTACGAG ACCGTQTCTG GAACGCCGTT GGAGACTGCA GCCTCCGCCG CCOCTTCAGC 5400 

CGCTQCAQCC ACCGCCCQCG GGATTGTGAC TGACTTTGCT TTCCTGAGCC CGCTTGCAAG 5460 

CAGTGCAGCT TCCCGTTCAT CCGCCCGCGA TGACAAGTTG ACGGCTCTTT TGGCACAATT 5520 

GGATTCTTTG ACCCGGGAAC TTAATGTCGT TTCTCAGCAG CTGTTGGATC TGCGCCAGCA 5580 

GGTTTCTGCC CTGAAGGCTT CCTCCCCTCC CAATGCGGTT TAAAACATAA ATAAA 5635 

15 f2) INFORMATION FOR SBQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 36 base pairs 
(B} TYPB: nucleic acid 
20 (C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 



10 



25 



30 



40 



<ii) MOLECULE TYPE: cDNA 

(Xi) SEQUENCE DESCRIPTION: SBQ ID NO:4: 
ACTCTTOAOT GCCAGCGAGT AGAGTTTTCT CCTCCG 36 
(2) INFORMATION FOR SEQ ID NO:5: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH; 29 base pairs 
35 (B) TYPE: nucleic acid 

(C) STRANDBDNBSS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
45 GCAAAOGAGC OATCCACACG AAATGTGCC 2 9 

(2) INFORMATION FOR SEQ ID NO:6: 



* (i) SEQUENCE CHARACTERISTICS: 

50 (A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

55 (ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:6: 



30 



35 



# 
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CTCCTCCGAG CCGCTCCGAG CTAG 

(2) INFORMATION FOR SEQ ID NO: 7: 

5 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
10 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

15 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 
CCAAAAATGG CTGGGTGTAG GAGCAGTGTC C 
20 (2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 
25 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: CDNA 

(Xi) SEQUENCE INSCRIPTION: SEQ ID NO: 8: 
CGGATCCTTT ATTATAOOGG AAGTCCACGC CTAC 
(2) INFORMATION FOR SEQ ID NO:9: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 32 base pairs 
40 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



■45 



(ii) MOLECULE TYPE: cDNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
50 CGGGATCCAT CGATGAAATA TGACTACGTC CG 
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Claims 

1 . An adenovirus-based gene therapy vector comprising the genome of an adenovirus 2 
serotype in which the Ela and Elb regions of the genome, which are involved in early stages 

5 of viral replication, have been deleted and replaced by genetic material of interest. 

2. The adenovirus-based gene therapy vector of claim 1, wherein the genetic material of 
interest is DNA encoding cystic fibrosis transmembrane conductance regulator 

10 3. The adenovirus-based gene therapy vector of claim 1 further comprising PGK 
promoter operably linked to the genetic material of interest. 

4. The adenovirus-based gene therapy vector of claim 2 having substantially the same 
nucleotide sequence as shown in Table II (SEQ ID NO:3). 

15 

5 . An adenovirus-based gene therapy vector comprising adenovirus inverted terminal 
repeat nucleotide sequences and the minimal nucleotide sequences necessary for efficient 
replication and packaging and genetic material of interest 

20 6. The adenovirus-based gene therapy vector of claim 5 having the adenovirus 2 
sequences shown in Figure 17. 

7. The adenovirus-based gene therapy vector of claim 5 further comprising PGK 
* promoter operably linked to the genetic material of interest % 

25 

8. The adenovirus-based gene therapy vector of claim 5 in which the genetic material of 
interest is selected from the group consisting of DNA encoding: cystic fibrosis 
transmembrane conductance regulator, Factor Vm, and Factor DC. 

30 9. An adenovirus-based gene therapy vector comprising an adenovirus genome which 
has been deleted for all E4 open reading frames, except open reading frame 6, ai^i 
additionally comprising genetic material of interest » 

1 0. The adenovirus-based gene therapy vector of claim 9 further comprising PGK 
35 promoter operably linked to the genetic material of intd^t 

1 1 . The adenovirus-based gene therapy vector of claim 9 in which the Ela and Elb regions 
of the genome, which are involved in early stages of viral replication, have been deleted. 
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12. The adenovirus-based gene therapy vector of claim 9 in which the E3 region has been 
deleted. 

13. An adenovims-based gene therapy vector comprising an adenovirus genome which 
5 has been deleted for all E4 open reading frames, except open reading frame 3, and 

additionally comprising genetic material of interest. 

1 4. The adenovims-based gene therapy vector of claim 1 3 in which the Ela and Elb 
regions of the genome, which arc involved in early stages of viral replication, have been 

la deleted. 

» ♦ ■ 

1 5. The adenovirus-based gene therapy vector of claim 13 further comprising PGK 
promoter operably linked to the genetic material of interest 

IS 16. The adenovirus-based gene therapy vector of claim 13 in which the E3 region has 
been deleted. 

1 7. A method for treating or preventing cystic fibrosis in a patient comprising 
administering to the pulmonary airways of the patient, a gene therapy vector comprising 

20 DNA encoding cystic fibrosis transmembrane conductance regulator. 

18. The method of claim 17 wherein the gene therapy vector is an adenovirus-based gene 
therapy vector comprising the genome of an adenovirus 2 serotype in which die Ela and Elb 
regions of die genome, which are involved in early stages of viral replication, have been 

25 deleted and replaced by DNA encoding cystic fibrosis transmembrane conductance 
regulator. 

19. The method of claim 17 wherein the gene therapy vector further comprises PGK 
promoter operably linked to die DNA encoding cystic fibrosis tran sm embrane conductance 

30 regulator. 

20. The method of claim 1 7 wherein the gene therapy vector is 4n adenovirus-based gene 
therapy vector comprising adenovirus inverted terminal repeats and the minimal sequences 
necessary for efficient replication and packaging and DNA encoding cystic fibrosis 

35 tranmembrane conductance regulator. 

2 1 . The method of claim 20 wherein the gene therapy vector further comprises PGK 
promoter operably linked to the DNA encoding cystic fibrosis transmembrane conductance 



regulator. 
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22. The method of claim 1 7 wherein the gene therapy vector is an adenovirus-based gene 
therapy vector comprising an adenovirus genome which has been deleted for all E4 open 
reading frames, except open reading frame 6, and additionally comprising DNA encoding 
5 cystic fibrosis transmembrane conductance regulator. 



23. The method of claim 22 wherein the gene therapy vector further comprises PGK 
promoter operably linked to the DNA encoding cystic fibrosis transmembrane conductance 
regulator. 

24. The method of claim 1 7 wherein the gene therapy vector is an adenovirus-based gene 
therapy vector comprising an adenovirus genome which has been deleted for all E4 open 
reading frames, except open reading frame 6, and has been deleted for die Ela and Elb regions 
of the genome, which are involved in early stages of viral replication, and additionally 

1 5 comprising DNA encoding cystic fibrosis tranmembrane conductance regulator. 

25. The method of claim 24 wherein the gene therapy vector further comprises PGK 
promoter operably linked to the DNA encoding cystic fibrosis transmembrane conductance 
regulator. 
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